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MULTIVALENT STREPTOCOCCAL VACCINE 
COMPOSITIONS AND METHODS FOR USE 



CROSS-REFERENCE TO RELATED APPLICATION 

This application claims the benefit of U.S. Provisional Patent Application 
5 No, 60/348,434 filed October 26, 200 L This provisional application is incorporated herein 
by reference in its entirety. 

STATEMENT OF GOVERNMENT INTEREST 

This invention was made with government support under Grant No. AI- 
10085 awarded by the National Institutes of Health, and support from the Department of 
10 Veteran Affairs, Merit Review funds. The government may have certain rights in this 
invention. 

BACKGROUND OF THE INVENTION 

Field of the Inveni ion 

The present invention relates generally to the prevention of infectious 
1 5 disease, and more specifically, to compositions comprising muhivalent hybrid polypeptides 
having immunogenic M protein and Spa peptides capable of eliciting protective immunity. 



Description of the Related Art 

Group A streptococcal pharyngitis is one of the most common bacterial 
infections in school-age children. In addition to streptococcal pharyngitis, there may be 

20 associated nonsuppurative sequela, such as acute rheumatic fever (ARF). Although the 
incidence of ARF has declined in developed countries, the disease remains rampant in 
developing countries (Community prevention and control of cardiovascular diseases. 
Report of a WHO expert committee. Worid Health Organization. 1986:732). Another 
form of streptococcal infection is invasive, which afflicts thousands of children and adults 

25 each year, often resulting in death or significant morbidity (Stevens, J, Infect Dis. 2:S366, 
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1999). Efforts to develop a vaccine that v/ould prevent group A streptococcal infections 
have been ongoing for over eight decades (Dochez et al„ J. Exp. Med 50:179, 1919; 
Uncefield, J. Exp. Med 47:9\A 928). 

Hence, a need exists for identifying and developing compositions 
5 therapeutically effective against streptococcal infections, particularly those compositions 
that can function as a vaccine and elicit protective immunity. Furthermore, there is a need 
for vaccine formulations that can be varied to protect against or treatment for infection by 
different streptococcal serotypes. The present invention meets such needs, and further 
provides other related advantages. 

1 0 SUMMARY OF THE INVENTION 

The present invention provides the discovery of therapeutic formulations of 
multivalent hybrid polypeptides, particularly a cocktail of hybrid polypeptides useful for 
eliciting a protective immune response having broadly protective opsonic antibodies in the 
absence of tissue cross-reactive antibodies. Hybrid polypeptides of the invention comprise 

15 at least six different linked immunogenic peptides, wherein each peptide comprises an 
amino-terminal portion of a streptococcal M protein of at least 30 amino acids, and wherein 
the polypeptide has an amino-terminal peptide that is reiterated as a carboxy-terminal 
peptide and the polypeptide is capable of eliciting an immune response against more than 
one serotype of group A streptococci. 

20 In one aspect, the invention provides a hybrid polypeptide, comprising at 

least six different linked immunogenic peptides, wherein each peptide comprises an amino- 
terminal portion of a streptococcal M protein of at least 30 amino acids, and wherein the 
polypeptide has an amino-terminal peptide that is reiterated as a carboxy-terminal peptide 
and the polypeptide is capable of eliciting an immune respK)nse against more than one 

25 antigen of group A streptococci comprising at least M5, M6, Ml 4, Ml 9, M24, and M29. 
In other embodiments, tlie polypeptide elicits an immune response against more than one 
antigen of group A streptococci comprising at least M2, Mil, M22, M33, M43, M59, and 
M94, or at least M75, M76, M77, M89, M92, MIOL and MI 14, or at least Spa, Ml.O, 
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M1.2, M3, M12, M18, and M28, or against 27 or more antigens of group A streptococci. 
In more embodiments, the amino-terminal immunogenic peptide of the hybrid polypeptide 
is M24, Ml, N42, or M89. In still other embodiments, the polypeptides are recombinant 
and the immunogenic peptides are linked in tandem, the polypeptides having a structure of 
5 M24-M5-M6-M19-M29-M14-M24, M2-M43-M94-M22-M1 1-M59-M33-M2, M89-M101- 
M77-M114-M75-M76-M92-M89, or M1.0-M12-Spa-M28-M3-M1.2-M18-M1.0. In yet 
other embodiments, the hybrid polypq>tides comprise the amino acid sequence of 
SEQ ID N0S:2, 4, 6, or 8. In other embodiments, the immunogenic peptides are linked by 
at least two amino acids encoded by a nucleic acid sequence that is a restriction enzyme 

10 recognition site, wherein the recognition site includes at least one of BamHl, Clal, £caRI, 
Hindllly Kpnly Nco\ Nhel, Pmll, PsA^ Sail, and XhoL In fiirther embodiments, any of the 
aforementioned hybrid polypeptides capable of eliciting at least one opsonic antibody that 
is not a tissue cross-reactive antibody in a subject, wherein the subject is a human or an 
animal. In more embodiments, any of the aforementioned hybrid polypeptides further 

15 comprising a carboxy -terminal tag, wherein the carboxy-terminal tag is selected from the 
group consisting of alkaline phosphatase, P-galactosidase, hexahistidine, FLAG®, 
XPRESS^, and GST. In still more embodiments, any of the aforementioned hybrid 
polypeptides or fusion proteins further comprise at least one additional carboxy-terminal 
amino acid, wherein the additional carboxy-terminal amino acid is a D-amino acid or 

20 cysteine. 

In another aspect, the invention provides any of the aforementioned hybrid 
polypeptides or fusion proteins wherein the polypeptides are synthetic. In certain 
embodiments, the synthetic hybrid polypeptides have one or more amino acids altered to a 
corresponding D -amino acid or are linked to an alkane such as acrylamide or an analog or 
25 derivative thereof. In a further embodiment, the synthetic hybrid polypeptides have 
immunogenic peptides linked to form a lysine core-branched peptide. 

In still another aspect, the invention includes a nucleic acid molecule 
comprising a sequence encoding a hybrid polypeptide of SEQ ID N0S:2, 4, 6, or 8. In a 
related embodiment, there is provided a nucleic acid expression construct comprising an 
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expression control sequence operably linked to a polynucleotide encoding a hybrid 
polypeptide of SEQ TD N0S:2, 4, 6, or 8. In other embodiments, the expression construct 
comprises a nucleic acid expression vector selected from the group comprising a plasmid, 
phagemid, shuttle vector, cosmid, or virus. In one embodiment, the vector is plasmid pT5 
5 (SEQ ID NO: 1 7). In still another embodiment, there is provided a host cell containing any 
of the aforementioned nucleic acid constructs. In yet other embodiments, the host cell is 
selected from a bacterium, a yeast cell, a nematode cell, an insect cell, or a mammalian 
cell. In one embodiment the host cell is the bacterium Escherichia coli. 

In a further aspect, the present invention provides a plurality of antibodies, 

10 comprising two or more different antibodies wherein each antibody is specific for a 
different immunogenic peptide of a hybrid polypeptide, the polypeptide comprises at least 
six different immunogenic peptides linked in tandem, each peptide comprises at least 30 
amino acids and the amino-terminal peptide is reiterated as a carboxy-terminal peptide, 
wherein the polypeptide is capable of eliciting an immune response against more than one 

15 antigen of group A streptococci comprising at least MS, M6, M14, M19, M24, and M29. 
In other ^bodiments, the plurality of antibodies are specific for more than one antigen of 
group A streptococci comprising at least M2, Mil, M22, M33, M43, M59, and M94, or at 
least M75, M76, M77, M89, M92, MlOl, and M114, or at least Spa, Ml.O, Ml. 2, M3, 
M12, M18, and M28, or specific for 27 or more antigens of group A streptococci. In yet 

20 other embodiments, any of die aforementioned antibodies wherein the hybrid polypeptides 
comprise the amino acid sequence of SEQ ID N0S:2, 4, 6, or 8. In other embodiments, 
any of the aforementioned antibodies are opsonic and not tissue cross-reactive in a subject. 
In one embodiment, the antibodies are polyclonal. 

In yet another aspect, there is provided a method of producing a hybrid 

25 polypeptide, comprising culturing a host cell containing any of the aforementioned nucleic 
acid expression vectors comprising at least one expression control sequence operably 
iiui^^KX iu a ittioiclc acia iiivjlccule encoding a liybiiu pul>pcpvidc of lLj inOS;/, 4, 6, or 
8, under conditions and for a time sufficient for expression of the polypeptide. In a related 
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aspect, the invention provides any of the aforementioned hybrid polypeptides produced by 
the aforementioned method. 

In another aspect, there is provided a composition comprising a 
pbarmaceutically acceptable carrier and any of the aforementioned hybrid polypeptides. In 
5 other aspects, the invention is a cocktail composition comprising a pbarmaceutically 
acceptable carrier and a mixture of at least two or three of any of the aforementioned 
hybrid polypeptides. In one embodiment, the cocktail compositions include at least one of 
the hybrid polypq)tide$ having a Spa immunogenic pq)tide« In other embodiments, 
provided are any of the aforementioned compositions wherein the hybrid polypeptides are 

10 mixed in equimolar amounts. In further embodiments, any of the aforementioned 
compositions further comprise an adjuvant such as alum or Freund*s. 

In still another aspect, the invention provides a method for preventing a 
microbial infection, con^rising administering to a subject any of the aforementioned 
compositions at a dose sufficient to elicit antibodies specific for one or more hybrid 

15 polypeptide, wherein the antibodies are opsonic and are not tissue cross-reactive. In certain 
embodiments, the microbial infection bemg prevented is a streptococcal infection and in a 
related embodiment is a group A streptococcal infection. In some embodiments, any of the 
aforementioned compositions are administered to a subject by a route selected from enteral, 
parenteral, transdermal, transmucosal, or itihalation. In further embodiments, the 

20 compositions are administered to a human or animal subject and the compositions further 
comprise an adjuvant such as alum or Freund's. In another related aspect, there is provided 
isolated antibodies produced by the aforementioned methods for preventing a microbial 
infection. In certain embodiments, the antibodies produced by these methods will comprise 
at least one antibody specific for a M serotype not represented in a hybrid polypeptide, 

25 such as M4. In still another embodiment, there is provided a method for treating or 
preventing a microbial mfection comprising administering to a subject a composition 

of antibodies. In one embodiment, the subject is an animal or human. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows a schematic diagram of the four hybrid polypeptides used as 
a vaccinating agent. The oligonucleotide primers used to amplify a 5' emm gene fragment 
(Le., immunogenic pqjtide) by PGR for each serotype were synthesized to include the 
5 indicated unique restriction enzyme sites, which encode amino acids that link the 
immunogenic peptides in tandem (indicated by the lines between the boxes). Each box 
represents an amino-tenninal M protein immunogenic peptide^ wherein the number within 
each box designates the serotype and the number below each box indicates the number of 
amino acids comprising each immunogenic peptide. Serotype Ml 01 was formerly 

10 designated stNSS, serotype MI 14 was formerly designated j/2967, and serotype M94 was 
formerly designated MI3 W and M13. The "6xHis" refers to the presence of a 
hexahistidine, which is optional 

Figures 2 A to 2C show the nucleotide and amino acid sequence of hybrid 
polypeptide Hexavalent A.3. The restriction enzyme sites and the beginning point of each 

15 M serotype immunogenic peptide are indicated. Eleven Arg codons were optimized for 
expression in E. coli by mutating AGG/AGA codons to CGT/CGC codons (mutated bases 
are underlined). 

Figures 3 A to 3D show the nucleotide and amino acid sequence of hybrid 
polypeptide Hexavalent B.3a. The restriction enzyme sites and die beginning point of each 
20 M serotype immuno^nic peptide are indicated. Nine Arg codons were optimized for 
expression in £ coli by mutating AGG/AGA codons to CGT/CGC codons (mutated bases 
are underlined). 

Figures 4A to 4C show the nucleotide and amino acid sequence of hybrid 
polypeptide Hexavalent C.2. The restriction enzyme sites and the beginning point of each 
25 M serotype immunogenic peptide are indicated. Seven Arg codons were optimized for 
expression in £. coli by mutating AGG/AGA codons to CGT codons (mutated bases are 

Figures 5 A to 5C show the nucleotide and amino acid sequence of hybrid 
polypeptide Hexavalent D.3, The restriction enzyme sites and the beginning point of each 
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M serotype immunogenic peptide are indicated. Five Arg codons were optimized for 
expression in £. coll by mutating AGG/AG A codons to CGT/CGC codons (mutated bases 
are underlined). 

Figure 6 shows a summary of the 28 most common invasive group A 
5 streptococci M serotypes isolated in ttie U.S. between August 12, 2000 and July 16, 2001 . 
The data were part of ongoing studies conducted by the Active Bacterial Core Surveillance 
Program of the Centers for Disease Control and Prevention. These 28 serotypes accounted 
for 92. 1 % of the 3,424 invasive isolates submitted during this period. 

Figure 7 shows that type-specific M protein and Spa antibodies were elicited 
10 by a vaccinating agent comprising a cocktail of four different hybrid polypeptides {i.e., a 
27-valent vaccine) when examined by ELISA. Each bar represents serum fix)m one rabbit. 

Figure 8 shows the results of in vitro opsonization assays using immune 
sera, which were elicited in rabbits with a composition comprising a cocktail of four 
different hybrid polypeptides (re., a 27-valent vaccine). A positive response was 
15 considered at least a 3-fold increase over opsonization with pre-immune (/ e., 30% or 

greater opsonization). The pre-immune sera resulted in less than 10% opsonization of each 
streptococcal serotype. Each bar represents serum from one rabbit. 

Figure 9 shows the results of bactericidal activity assays using immune sera, 
which were elicited in rabbits with a vaccinating agent comprising a cocktail of four 
20 different hybrid polypeptides (/, e. , a 27-valent vaccine), A positive response was 
considered at least a 50% percent killing. Each bar represents serum from one rabbit. 

Figure 10 shows that an immune response with bactericidal activity is 
elicited against group A streptococcal serotype 4, \\iiich is a serotype not represented in the 
27-valeot vaccinating agent. Each bar represents serum from one rabbit. Immune sera 
25 were from a rabbit immunized at 0, 4, and 8 weeks (stripped bar) and from a rabbit 

immunized at 0, 4, and 16 weeks (open bar). The serotype 4 streptococcal isolates were 

(CA); Connecticut (CT); and South Dakota (SD). 
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Figure 1 1 shows the geometric mean antibody titers before (Day 0) and after 
(Day 134) immunization of human subjects, as determined by ELISA. The graph is in a 
logio scale. Serotype MlOl was formerly designated stNSS, serotype st2967 is now 
designated Ml 14, and serotype M13 is now designated M94. 
5 Figure 12 shows the natural log fold-increase in ELISA-determined 

antibody titer (bars) as compared to percent increase in Group A streptococcal killing by 
polymorphonuclear cells in 26 serotypes. Serotype MlOl was formerly designated stNSS, 
serotype Ml 14 was form^ly designated 5/2967, and serotype M13 is now designated M94. 
Figure 13 shows the fold-increase in geometric mean antibody titers in 
10 human subjects immunized with a hexavalent polypeptide (Hexa 1.2) as compared to the 
geometric mean antibody titers in human subjects immunized with a 27-valent cocktail of 
four hybrid multivalent polypeptides (Hexa A.3, Septa B.3a, Septa C.2, and Septa D.3; see 
Figure 1). Geometric mean antibody titers were calculated based on ELIS As against the 
six serotypes present in both vaccines. 

15 DETAILED DESCRIPTION OF THE INVENTION 

As noted above, the present invention is generally directed to hybrid 
polypeptides of streptococcal antigens and compositions thereof, which are capable of 
eliciting protective antibodies against streptococci. Furthermore, the compositions may 
include a single hybrid polypeptide or a combination of several different hybrid 

20 polypeptides, which may be useful to elicit an immune response against group A 
streptococci. In one aspect, one or more hybrid polypeptide may be formulated as a 
composition for simultaneously treating or preventing an infection by several different 
group A streptococcal serotypes, as well as treating or preventing infection by serotypes 
not represented in die hybrid polypeptid6(s). The present invention also provides isolated 

25 nucleic acids that encode such hybrid polypeptides, as well as methods of expressing and 

several different linked immunogenic peptides wherein each peptide comprises, for 
example, an amino*terminal portion of a streptococcal M protein and can elicit opsonic 
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antibodies specific for a particular group A streptococcal serotype without eliciting 
antibodies that are cross-reactive with host tissue. 

The present mvention also provides antibodies specific for each 
immunogenic peptide serotype included in a hybrid polypeptide or in a combination of 
5 hybrid polypq)tides. as well as antibodies specific for serotypes not included in the hybrid 
polypeptide(s) (i.e., antibodies that confer cross-protection between serotypes). The 
invention, th^efore, relates genially to the surprising discovery that highly complex, 
multivalent hybrid polypeptide-based vaccines are feasible to simultaneously elicit broadly 
protective antibodies against several different streptococcal serotypes. Moreover, a 

1 0 composition comprising a cocktail of more than one multivalent hybrid polypeptide used as 
a vaccine unexpectedly elicits a much more robust antibody response in humans than does 
a composition comprising a single multivalent hybrid polypeptide, ^^ilerein the increase in 
antibody response also results in an increase in antibody function (/.e., increased ability to 
opsonize and/or to kill microorganisms). As used herein, a "cocktail" of hybrid 

15 polypeptides refers to a composition comprising at least two different hybrid polypeptides 
of this invention. Hence, the invention also relates generally to the surprising discovery 
that a cocktail of multivalent hybrid polypeptides can function synergistically to elicit a 
greater antibody response that is a protective immune response. Accordingly, the 
compositions and methods of the present invention may be readily used to treat or prevent 

20 streptococcal infections. Discussed in more detail below are hybrid polypeptides and 
assorted compositions thereof (including admixtures or cocktails) suitable for use within 
the present invention, as well as exemplary methods of making such hybrid polypeptides 
and compositions, and therapeutic uses thereof. 

Hybrid Polypeptides 

25 The present invention is directed generally to multivalent immunogenic 

hybrid polypeptides of M proteins and M-like proteins, including fusions to other proteins. 
The immunogenic M and M-like peptides may comprise any portion of an M protein that is 
immunogenic, which may or may not confer serotype specificity. A plurality of different 
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multivalent immunogenic hybrid polypqptides can be mixed or combined into a cocktail 
composition for use in eliciting a protective immune response. The present invention 
further provides methods for producing synthetic or recombinant multivalent immunogenic 
hybrid M polypeptides, including fusion proteins. For example, host cells containing 
5 hybrid polypeptide-encoding nucleic acid expression constructs may be cultured to produce 
recombinant hybrid polypeptides. Also contemplated are methods for treating or 
preventing a microbial infection or eliciting an immune response using a hybrid 
polypeptide or a combination of hybrid polypeptides (including fusion proteins). 

By way of background and not wishing to be boxmd by theory, streptococcal 

10 species are Gram-positive bacteria that are grouped based on immunological differences in 
their cell wall polysaccharides and are designated, for example, group A, B, C, F, and G. 
Specifically, group A streptococci (GrAS) are clinically important microorganisms that 
colonize the throat and skin. GrAS are responsible for a variety of suppurative infections 
(e.g., strep throat, necrotizing fasciitis) and non-suppurative sequelae {e.g., acute rheumatic 

1 5 fever, reactive arthritis) (see generally Cunningham, Clin. Microbiol. Rev, ii:470, 2000). 
GrAS have two major, surface-exposed anti-phagocytic factors, capsule and M protein, 
which allow these organisms to colonize and survive in a host. The M protein, which is 
encoded by the emm gene, extends from the cell surface as an a-helical coiled-coil dimer 
that appears as a fibril on the surface of GrAS. The M proteins are a diverse group, which 

20 have been serologically separated into M protein serotypes. 

Currently, more than 120 M protein serotypes have been identified, and 
within some serotypes there have been identified several subtypes. For example, without 
limitation, type 1 M protein has related subtypes LI- LI 5, and type 12 has subtypes 12,1- 
12.9. In addition, as is known in the art, unclassified serotypes may have an initial 

25 designation (such as ^^2967) and ultimately receive a "final" classification (such as st2967 
now being classified as M type 114), or previously classified serotypes may be reclassified 

embodiments, amino-terminal portions of any one or more of known M protein from 
serotypes 1-120+, or from unknown serotypes, can be used to generate immunogenic 
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peptides for inclusion in multivalent immunogenic hybrid polypeptides of the instant 
invention. Any numerical ranges recited herein are to be understood to include any integer 
within the range and, where applicable {e.g., concentrations), fractions thereof, such as one 
tenth and one hundredth of an integer (unless otherwise indicated), 
5 Furthermore, the M protein is part of a superfemily of proteins, which 

includes without limitation, immunoglobulin binding proteins (e.g., FrcA), M*like proteins 
{e.g,j Mrp and Spa), and M proteins. Accordingly, as used herein, '*M protein" refers to the 
superfamily of proteins, which includes any known or unknown M protein (with or without 
a designated serotype), as well as M-like proteins, such as Spa (see, e.g.. Dale et al,, J. 

10 Clin. Invest. 705:1261, 1999 and McLellan et al. Infect. Immun. 5P:2943, 2001), Mrp {see, 
e.g,^ Boyle et al, J. Infect. Dis, 777:991, 1998), immunoglobulin binding proteins {see, 
e.g, Podbielski et aL, Mol Microbiol 72:725, 1994; Whatmore and Kehoe, Mol 
Microbiol, 77:363, 1994), and the like. As described herein and is understood in the art, 
the serotype of an M protein may be reclassified and result in a change to a different type, a 

15 new type, or even a subtype. Also, as used herein, "M protein serotype" or "M type" refers 
to all M proteins within that serotype, including all subtypes, related proteins, and the like. 
Furthermore, reference to a particular M type may be applied interchangeably as foUow^s, 
by way of illustration and not limitation, serotype 1 M protein, type 1, Ml, and the like, 
which as set forth above, includes all subtypes as well. 

20 As described above, group A streptococci have developed a system for 

avoiding some of the antimicrobial defenses of a human host. Strains of streptococci that 
express capsule and M protein can evade phagocytosis by, for example, 
polymorphonuclear leukocytes or neutrophils and multiply in a host that has not been 
exposed to streptococci (/.e., have non-immune blood). Yet, a subject may develop 

25 resistance to a streptococcal infection if the host can produce protective antibodies directed 
against streptococci. Protection against GrAS generally correlates with the presence of 
opuoiiiziii^ »i*;ubuvly agviini>l lypc-ijpcuijic 14 proicia {^cc, LaiKv^fiwIci, J, Inuuunol. 
89:i07, 1962), and the development of secretory or mucosal antibodies is suspected of 
playing an important role in preventing initial colonization by streptococci. However, the 
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pathogenesis of some of the non-suppurative sequelae may be due to host tissue cross- 
reaction with streptococcal antibodies. As used herein, "tissue cross-reactivity" means a 
host antigen shares at least one epitope with a foreign antigen, such as a streptococcal 
antigen. For example, the different antigens may share an identical amino acid sequence, 
5 may be homologous but non-identical, or may be dissimilar molecules with a shared 
epitope (e g., protem and carbohydrate or protein and nucleic acid molecule). Thus, an 
effective vaccine against streptococcal infections would preferably elicit an inmiune 
response that includes antibodies that are not tissue cross-reactive (Le,, do not cause 
autoimmune disease) and are protective (Le., opsonic) against many of the clinically 

10 important serotypes. 

llie present invention pertains to hybrid polypeptides or variants thereof 
having a plurality of immunogenic peptides from M or M-like proteins, or nucleic acid 
molecules encoding such polypeptides or variants thereof. As used herein, "immunogenic 
peptide" refers to any streptococcal peptide or polypeptide having at least one epitope 

1 5 capable of eliciting an immune response, which are the component units of the hybrid 
polypeptides. Preferably, the epitope is within an amino-terminal portion of a streptococcal 
M protein or Spa protein and more preferably is an opsonic epitope that does not elicit 
tissue cross-reactive antibodies. 

The present invention also provides a rational vaccine design approach for 

20 selection of the streptococcal M protein serotypes to be included in the muhivalent hybrid 
polypeptide vaccine. Some criteria fliat may be used, without limitation, include 
identifying the M protein serotypes that are frequent causes of uncomplicated pharyngitis, 
identifying the serotypes that are commonly recovered from normally sterile sites (i.e., 
invasive strains) {e,g., useful data may obtained from the Active Bacterial Core 

25 Surveillance of the Emerging Infections Program Network, supported by the Centers for 
Disease Control and Prevention; see also Beall et aL, J Clin, Microbiol J5:1231, 1997; 

considered currendy or historically "rheumatogenic*' (Bisno AL. The concept of 
rheumatogenic and nonrheumatogenic group A streptococci. In: Reed SB, and J. B. 
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Zabriskie, ed- Streptococcal Diseases and the Immune Response. New York: Academic 
Press, 1980:789-803). Also usefiil is the amino-terminal peptide fragment of Spa, a new 
protective antigen that is expressed by several serotypes of group A streptococci (Dale et 
al, y. Clin. Invest 705:1261, 1999), or any other streptococcal antigen capable of eliciting 
5 protective antibodies may be used. 

As described herein and known in the art, M protein amino acid sequences 
selected for inclusion in a hybrid polypeptide are available at the CDC emm typing center 
website (www.cdc.gov/ncidod/biotech/strep/einmtypes.html). In addition, to eliminate the 
possibility of eliciting tissue cross-reactive antibodies, the amino-terminal regions of a 

10 mature M protein and Spa may be compared to known human proteins to detect any 
homology (e.g., using BlastP). Preferably, amino-terminal M protein portions having five 
or more contiguous amino acid matches with a human protein are excluded. In addition, 
the selected regions of the M peptides and Spa are preferably analyzed by the method of 
Hopp and Woods (Mol Immunol. 20:483, 1983) to ensure the integrity of hydrophilic 

15 peaks. For example die Hopp and Woods method may be helpful in predicting that a 
particular immunogenic peptide from an M protein amino-terminal may best be fused with 
a certain subset of other immunogenic peptides and not with others. 

In preferred embodiments, the hybrid polypeptides or variants, and 
combinations thereof, may be designed to be vaccines specific for streptococci associated 

20 with, for example, pharyngitis, scarlet fever, acute rheumatic fever, necrotizing faciitis, 
cellulitis, meningitis, pneumonia, toxic shock syndrome, bacteremia, septicemia, septic 
arthritis, pyoderma, skin infections, impetigo, erysipelas, soft-tissue infection, nephritis, 
pyogenic reactions, and the like. Additionally, the vaccines may be designed to treat or 
prevent streptococcal infections in particular populations (e.g., immunocompromised 

25 patientSj children, and elderly) particular geographic populations (e.g., Australian 

aborigines), and particular geographic locations {e.g. , temperate regions or Scandinavian 
cuuiiuicsj. riciciabl), tin. auiiriu-iLiiiiiirai puriiaiis ofiiic diiibrciU M proteins thai 
comprise an immunogenic peptide can elicit opsonic antibodies that do not cross-react with 
host tissue. 

13 
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In another preferred aspect, the M protein serotype is selected based on its 
prevalence on the most common streptococci known to currently be associated with a 
particular disease (e.g., serotypes associated with pharyngitis or skin infections) or 
sequelae. In a further preferred aspect, the instant invention may be used to design and 
generate a variety of different multivalent immunogenic hybrid polypqptides that may be 
directed to, or be admixed in particular groupings to address, shifts in prevalent 
streptococci. For example, as is known in the art, the most prevalent streptococci serotype 
associated with a disease today, such as ARF, may not be as prevalent or important 5, 10, 
or 1 5 years. In another example, a particular streptococcal serotype may be prevalent in 
Europe and suddenly become important in South America. Hence the immunogenic 
peptides that comprise a multivalent hybrid polypeptide may be changed, or different 
multivalent hybrid polypeptides may be mixed and matched to create a desired cocktail, 
designed for use in a particular population or location to attack the most clinically relevant 
streptococci as needed. 

In certain preferred embodiments, an immunogenic peptide comprises an 
amino-terminal portion of a M protein or Spa pmtein having 10-70 amino acids, or any 
integer in that range; more preferably having 20-65 amino acids, or any integer in that 
range; and most preferably 30-60 amino acids, or any integer in that range. In particularly 
preferred embodiments, a hybrid polypeptide comprises at least six or seven different 
linked immunogenic peptides, wherein each peptide comprises an amino-terminal portion 
of a streptococcal M protein of at least 30 amino acids, and wherein the polypeptide has an 
amino-terminal peptide that is reiterated as a carboxy-terminal peptide and the polypeptide 
is capable of eliciting an immune response against more than one serotype of group A 
streptococci. Preferably, a hybrid polypeptide is capable of eliciting an immune response 
against at least serotypes 5, 6, 14, 19, 24, and 29; or at least against serotypes 2, 1 1, 22, 33, 
43, 59, and 94; or at least against serotypes 75, 76, 77, 89, 92, 101, and 1 14; or at least 
against sciolvpcfr; i.O, 1.2, 5, 12, IS, and 28. In aiioilici- preferred cmbodin;c)U, a liybiiJ 
polypeptide is capable of eliciting an immune response against a M serotype not 
represented in the hybrid polypeptide, such as serotype 4. 



14 
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In certain embodiments, the hybrid polypeptides have at least 50% to 100% 
amino acid identity, or any integer in that range, to the amino acid sequence as set forth in 
SEQ ID N0S:2, 4, 6, and 8; preferably 60%-99% identity or any integer in that range, more 
preferably 70%-97% identity or any integer in that range, and most preferably 80%'95% 
5 identity or any integer in that range. As used herein, "percent identity*' or **% identity" is 
the percentage value returned by comparing the whole of the subject polypeptide, peptide, 
or variant thereof sequence to a test sequence using a computer implemented algorithm, 
typically with default parameters. Sequence comparisons can be performed using any 
standard software program, such as BLAST, tBLAST, pBLAST, or MegAlign. Still others 

10 include those provided in the Lasergene bioinformatics computing suite, which is produced 
by DNASTAR® (Madison, Wisconsin). References for algorithms such as ALIGN or 
BLAST may be found in, for example, Altschul, J. Mol. Biol 2/P:555-565, 1991; or 
Henikoff and Henikoff, Proc. Natl Acad Sci USA 5P: 109 15- 10919, 1992. BLAST is 
available at the NCBI website {wwwjicbiJilm.nih,gov/BLAST), Other methods for 

15 comparing multiple nucleotide or amino acid sequences by determining optimal alignment 
are well known to those of skill in the art (see, e.g., Peruski and Peruski, The Internet and 
the New Biology: Tools for Genomic and Molecular Research (ASM Press, Inc. 1997); Wu 
et aL (eds.), "Information Superhighway and Computer Databases of Nucleic Acids and 
Proteins/' in Methods in Gene Biotechnology, pages 1 23-15 1 (CRC Press, Inc. 1997); and 

20 Bishop (ed,). Guide to Human Genome Computing, 2nd Edition, Academic Press, Inc., 
1998). As used herein, "similarity" between two peptides or polypeptides is generally 
determined by comparing the amino acid sequence of one peptide or polypeptide to the 
amino acid sequence and conserved amino acid substitutes thereto of a second peptide or 
polypeptide. Fragments or portions of the hybrid polypeptides of the present invention 

25 may be employed for producing the corresponding full-length hybrid polypeptide by 
peptide synthesis; therefore, the fragments may be employed as intemiediates for 
l-iouiiciii'^ ilic iwU-khj^ai li>biiu pojvjj^puucs. Siiuilaily, iVagmenis Oi portions oi* the 
nucleic acids of the present invention may be used to synthesize full-length nucleic acids of 
the present invention. 
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The hybrid polypeptides and corresponding nucleic acids of the present 
invention are preferably provided in an isolated form, and in certain preferred 
embodiments, are purified to homogeneity. As used herein, the term "isolated'' means that 
the material is removed fix>m its original or natural environment. For example, a naturally 
5 occurring nucleic acid molecule or polypq}tide present in a living animal or cell is not 
isolated, but the same nucleic acid molecule or polypeptide is isolated when separated from 
some or all of the co-existing materials in the natural system. The nucleic acid molecules, 
for example, could be part of a vector and/or such nucleic acids or polypeptides could be 
part of a composition and still be isolated in that such vector or composition is not part of 

1 0 its natural environment. 

The present invention also pertains to hybrid polypeptides and variants 
thereof produced synthetically or recombinantly, and preferably recombinantly. The 
immunogenic peptide components of the hybrid polypeptides may be synthesized by 
standard chemical methods, including synthesis by automated procedure. In general, 

IS immunogenic peptides are synthesized based on the standard solid-phase Fmoc protection 
strategy with HATU as the coupling agent. The immunogenic peptide is cleaved from the 
solid-phase resin with triiluoroacetic acid containing appropriate scavengers, which also 
deprotects side chain functional groups. Crude immunogenic peptide is further purified 
using preparative reversed-phase chromatography. Other purification methods, such as 

20 partition chromatography, gel filtration, gel electrophoresis, or ion-exchange 
chromatogrsq^hy may be used. Other synthesis techniques known in the art may be 
employed to produce similar immunogenic peptides, such as the tBoc protection strategy, 
use of different coupling reagents, and the like. In addition, any naturally occurring amino 
acid or derivative thereof may be used, including D- or L-amino acids and combinations 

25 thereof In particularly preferred embodiments, a synthetic hybrid polypeptide of the 
invention will have a M2, Ml 2, M24, or M89 immunogenic peptide at the amino-terminus. 

As dcscnbcd liurciii, ihc hybrid polypcpiitlcs of ihc invenlion nia> be 
recombinant, wherein the hybrid polypeptide is expressed from a polynucleotide encoding 
a desired hybrid polypeptide that is operably linked to an expression control sequence (e.g., 

16 
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promoter) in a nucleic acid expression construct In particularly preferred embodiments, a 
recombinant hybrid polypeptide of the invention will have a M2, Ml 2, M24, or M89 
immunogenic peptide at the amino-terminus. Some preferable recombinant hybrid 
polypeptides comprise immunogenic peptides linked in tandem having the structure of 
5 M24-M5-M6-M19-M29-M14-M24, M2-M43-M94'M22-M1 1-M59-M33-M2, M89-M10I- 
M77-M114-M75^M76-M92-M89, or ML0-M12-Spa-M28-M3-M1.2^M18'M1.0, and any 
combination thereof Most preferably, a hybrid polypeptide comprises the amino acid 
sequence of SEQ ID N0S:2, 4, 6, or 8, and variants thereof As set forth above and herein, 
any M serotype may be included in the present invention, preferably serotypes 1.0, 1.2, 2, 
10 3, 5, 6, 1 1, 12, 14, 18, 19, 22, 24, 28, 29, 33, 43, 59, 75, 76, 77, 89, 92, 94, 101, and 1 14 are 
included in the hybrid polypeptides. In certain preferred embodiments as described herein, 
hybrid polypeptides of the subject invention capable of eliciting at least one opsonic 
antibody that is not a tissue cross-reactive antibody in a subject, wherein the subject is an 
animal or a human. 

15 In another preferred embodiment, the hybrid polypeptides are linked by at 

least two amino acids encoded by a nucleic acid sequence that is a restriction enzyme 
recognition site, wherein the restriction sites may be any one or more of BamHh C/al, 
EcoRl, HindlW, Kpnl, Ncol, Nhel, Pmh, Pst\, Sail, Xhol and the like. Additional amino 
acid linkers may also be added synthetically as described herein. Preferably, the additional 

20 amino acids do not create any identity in sequence within a five amino acid stretch of a 
human protein. In addition, the hybrid polypeptides of the subject invention may further 
comprise at least one additional carboxy-terminal amino acid, wherein the additional amino 
acid is a D- or an L-amino acid. Any of the twenty naturally occurring amino acids or 
derivatives thereof may be added, such as cysteine, histidine. leucine, and glutamic acid. 

25 For example, the addition of cysteine may be useful to attach other constituents, such as a 
lipid, a carrier protein, and the like. 

As> described hcjcin, UiC iuvuuioti alsu piu\iaos liybiiJ polypcptioc iLsiuit 
proteins comprising hybrid polypeptides fused to an additional functional or non-functional 
polypeptide sequence that permits, for example by way of illustration and not limitation, 
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detection, isolation and/or purification of the hybrid polypeptide fusion proteins. For 
instance, an additional functional polypeptide sequence may be a tag sequence, which 
includes fusion proteins that may in certain embodiments be detected, isolated and/or 
purified by protein-protein affinity (e.g., receptor-ligand), metal aflfinity or charge affinity 
5 methods. In certain other embodiments the hybrid polypeptide fusion proteins may be 
detected by specific protease cleavage of a fusion protein having a sequence that comprises 
a protease recognition sequence, such that the hybrid polypeptides may be separable from 
the additional polypeptide sequence. In addition, the hybrid polypeptides may be made 
synthetically including additional amino acids, a carrier protein, or a tag sequence, which 

10 may be located at either the amino- or carboxy-terminal end. In particularly preferred 
embodiments, for example, recombinant hybrid polypeptides are fused in-fi^me to a 
carboxy-temiinal tag, vdiich tag may be any one of alkaline phosphatase, p-galactosidase, 
hexahistidine (6xHis), FLAG® epitope tag (DYKDDDDK, SEQ ID NO: 18), or GST, and 
the like. Most preferred are hybrid polypeptide fusion proteins that facilitate affinity 

15 detection and isolation of the hybrid polypeptides and may include, for example, poly-His 
or the defined antigenic peptide epitopes described in U.S. Patent No. 5,011,912 and in 
Hopp et al., (1988 Bio/Technology 5:1204), or the XPRESS^" epitope tag (DLYDDDDK, 
SEQ ID NO: 19; Invitrogen, Carlsbad, CA). The affinity sequence may be a hexa-histidine 
tag as supplied by a vector. For example, a pBAD/His (Invitrogen) or a pQE-30 (Qiagen, 

20 Valencia, CA) vector can provide a polyhistidine tag for purification of the mature protein 
fusion from a particular host, such as a bacterium. Alternatively, the affinity sequence may 
be added either synthetically or engineered into die primers used to recombinantly generate 
the nucleic acid sequence (e.g., using the polymerase chain reaction) encoding an 
immunogenic peptide of the multivalent hybrid polypeptide. Preferably, a multivalent 

2S hybrid polypeptide is fused to a polyhistidine and is encoded by a recombinant nucleic acid 
sequence encoding such a fusion protein. 

TiiC iiiiiiiUftu^cnic pcpiiijcs nvay albo uc clicuiicaily li^iucC lo [\j\ui uic 
desired hybrid polypeptides, including additional amino acid sequences, by a variety of 
methods, as provided herein and known in the art (see, generally, Jackson ei al. Vaccine 
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1 8:355, 2000). Recombinant or synthetic peptides may be linked to form linear {see, e.g.. 
Leclerc, et al, Eur 1 Immunol 17:26, 1987 and Francis, et al. Nature 330:168, 1987) or 
branched {see, e.g., Fitzmaurice, etal. Vaccine 14:553, 1996) constructs, or may be linked 
using chemical ligation of epitopes {see, e.g., Tarn and Spetzler, Biomed. PepL Proteins 
5 Nucleic Acids 1:123. 1995; Rose, Am. Chem. Soc. 116:30, 1994; and Dawson, et al. 
Science 266:776, 1994). Peptides may also be linked via the multipje antigen peptide 
system to form branched hybrid polypeptides {see, e.g., Tarn, Proc. Natl Acad. Set USA 
iS5:5409, 1988; U.S. Patent No. 5,229,490). The multiple antigen peptide system makes 
use of multifiinctional core molecules where each ftinctional group on the core molecule 

10 forms at least two branches, the principal units of which are also multifunctional. For 
example, lysine may be used as the core peptide because it has one carboxyl functional 
groi^ and two (a and s) amine functional groups. Each multifunctional unit in a branch 
provides a base for added growth, resulting in exponential growth of the dendritic polymer. 
Peptides may then be joined to the dendritic core using a linking molecule (e.g., glycine). 

15 The multiple antigen peptide system links a large number of synthetic peptides to the 
functional group of a dendritic core molecule providing a high concentration of synthetic 
peptides in a low molecular volume. Preferably, either two or three levels of geometrically 
branched lysines are used, wherein these lysine cores form a tetrameric and octameric core 
structure, respectively. The multiple antigen peptide system may also include a lipophilic 

20 anchoring moiety attached to the core molecule, thereby eliminating the need for an 
adjuvant formulated in a peptide vaccine otherwise reqmring one for immunostimulation 
{see, e.g., U.S. Patent No. 5,580,563). In one preferred embodiment, a hybrid polypeptide 
of the invention comprises immunogenic peptides linked to form a lysine core-branched 
polypeptide. 

25 Additionally, similar or different synthetic peptides may be linked by 

controlled polymerization through derivatization of the amino-terminus of a peptide with 
the aci-yloyl ^Cll2=CH-) group usiiig aciyloyl chloride {sea, c,^., O'Bricn-Shiipson, er uL, 
J, Am. Chem. Soc, 119:1183, 1997; Jackson, et al. Vaccine 15:1697, 1997). The 
derivatized peptides are then polymerized singly, or in admixture with similarly derivatized 
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peptides, by free radical initiation of chain elongation. As a result, peptides are assembled 
into polymers in which the peptide determinants form side chains pendant from an alkane 
backbone. The hybrid polypeptides and fusion proteins as described herein may be 
constructed as set forth above. In one preferred embodiment, a hybrid polypeptide of the 
5 invention comprises immunogenic peptides linked by an alkane backbone. In certain 
embodiments, the alkane backbone is acrylamide or an analog or derivative thereof. 

Therapeutic Formulations and Methods of Use 

The invention also relates to pharmaceutical compositions that contain one 
or more hybrid multivalent polypeptides, which may be used to elicit an immune response. 

10 The invention further relates to methods for treating and preventing microbial infections by 
administering to a subject a hybrid polypeptide or a mixture of hybrid polypeptides at a 
dose sufficient to elicit antibodies specific for one or more hybrid polypeptide, as described 
herein. A hybrid polypeptide or a cocktail of hybrid polypeptides is preferably part of a 
pharmaceutical composition when used in the methods of the present invention. 

IS In certain aspects, the invention provides a composition comprising a 

pharmaceutically acceptable carrier, diluent, or excipient, and any of the multivalent hybrid 
polypeptides of the subject invention and any combination thereof. A preferred 
embodiment is a pharmaceutically acceptable carrier and a mixture of at least two or three 
hybrid polypeptides of the subject invention. In yet another preferred embodiment, a 

20 hybrid polypeptide that comprises at least seven different immunogenic peptides linked in 
tandem, wherein each peptide comprises an amino-terminal portion of a streptococcal M 
protein of at least 50 amino acids, and wherein the polypeptide has an amino-terminal 
peptide that is reiterated as a carboxy-terminal peptide and the polypeptide is capable of 
eliciting an immune response against more than one serotype of group A streptococci 

25 comprising at least serotypes 1.0- 1 .2, 3, 12, 18, and 28, is combined with at least one other 
hybrid polypepiidc of Uie subject invention and a pharmaceutically accepiabic carrier. In a 
more preferred embodiment, a composition comprises a pharmaceutically acceptable 
carrier and a mixture of the hybrid polypeptides of SEQ ID N0S:2, 4, 6, and 8 or variants 
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thereof, and in other embodiment these four polypeptides or variants thereof are mixed in 
eqxiimolar amoimts and at least one of the hybrid polypeptides has a Spa peptide. In other 
embodiments, SEQ ID N0S:2, 4, 6, and 8 or variants thereof are provided with a 
polyhistidlne t^ or iiirther comprise at least one additional amino acid, such as cysteine. 
S Each of Aese formulations may further comprise an adjuvant, such as alum or Freund's, 
and a diluent such as water or PBS, Further, therapeutic compositions of the present 
invention shoxild preferably be stable for several months and capable of being produced 
and maintained under sterile conditions. 

The pharmaceutical composition will include at least one of a 

10 pharmaceutically acceptable vehicle, carrier, diluent, or excipient, in addition to one or 
more hybrid multivalent polypeptide or fiision protein thereof and, optionally, other 
components. For example, pharmaceutically acceptable carriers suitable for use with a 
composition of a hybrid polypeptide or fiision protein thereof, or cocktail of two or more 
hybrid polypeptide or fusion protein thereof may include, for example, a thickening agent, 

15 a buffering agent, a solvent, a humectant, a preservative, a chelating agent, an adjuvant, and 
the like, and combinations thereof. Exemplary adjuvants are alimi (aluminum hydroxide, 
REHYDRAGEL®), aluminum phosphate, proteosome adjuvant (see, e,g., U.S. Patent Nos. 
5,726,292 and 5,985,284), virosomes, liposomes with and without Lipid A, Detox 
(Ribi/Corixa), MF59, or other oil and water emulsions type adjuvants, such as 

20 nanoemulsions (see, e.g., U.S. Patent No. 5,716,637) and submicron emulsions (see, e.g., 
U.S. Patent No, 5,961,970), and Freund's complete and incomplete. Pharmaceutically 
acceptable carriers for therapeutic use are well known in the pharmaceutical art, and as 
described herein and, for example, in Remington's Pharmaceutical Sciences, Mack 
Publishing Co. (A,R, Gennaro, ed., 18* Edition, 1990) and in CRC Handbook of Food 

25 Drug, and Cosmetic Excipients, CRC Press LLC (S.C. Smolinski, ed., 1992). 

"Pharmaceutically acceptable salt" refers to salts of the compounds of the 
pi^scm inveuuon derived from tlit. conibiuaugti of such compounds and an oigunic or 
inorganic acid (acid addition salts) or an organic or inorganic base (base addition salts). 
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The compounds of the present invention may be used in either the free base or salt forms, 
with both forms being considered as being within the scope of the present invention. 

In addition, the phannaceutical composition may further include a diluent 
such as water or phosphate buffered saline (PBS). Preferably, diluent is PBS with a final 
5 phosphate concentration ranges from about 0.1 mM to about 1 more preferably firom 
about 0.5 mM to about SOO mM, even more preferably firom about 1 mM to about SO mM, 
and most preferably from about 2.S mM to about 10 mM; and the final salt concentration 
ranges from about 100 mM to about 200 mM and most preferably from about 125 mM to 
about 175 mM. Preferably, the final PBS concentration is about 5 mM phosphate and 

10 about 150 mM salt (such as NaCl). In certain embodiments, any of the aforementioned 
pharmaceutical compositions comprising a cocktail of multivalent hybrid polypeptides of 
the instant invention are preferably sterile. 

The compositions can be sterile either by preparing them under an aseptic 
environment and/or they can be terminally sterilized using methods available in the art 

15 Many pharmaceuticals are manufactured to be sterile and this criterion is defined by the 
USP XXn <12l 1>. Sterilization in this embodiment may be accomplished by a number of 
means accepted in the industry and listed in the USP XXII <1211>, including gas 
sterilization, ionizing radiation or filtration. Sterilization may be maintained by wiiat is 
termed asceptic processii^> defined also in USP XXII <121 1>. Acceptable gases used for 

20 gas sterilization include ethylene oxide. Acceptable radiation types used for ionizing 
radiation methods include gamma, for instance from a cobalt 60 source and electron beam. 
A typical dose of gamma radiation is 2.5 MRad. When appropriate, filtration may be 
accomplished using a filter with suitable pore size, for example 0.22 \xm and of a suitable 
material, for instance Teflon®. The term "USP" refers to U.S. Pharmacopeia (see 

25 wvm.usp.org; Rockville, MD). 

The present invention also pertains to methods for preventing a microbial 
iiuvwiiuri., Lonipri^iri^ auinlni:yiering lo a subject a compo.'^jiion of tlie subject in\on;ioii at a 
dose sufficient to elicit antibodies specific for one or more hybrid polypeptide, ^^erein die 
antibodies are preferably opsonic and are not tissue cross-reactive. In certain embodiments 
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an infection is a strqptococcal infection, such as a group A streptococcal infection. A 
subject suitable for treatment with a hybrid polypeptide formulation may be identified by 
well-established indicators of risk for developing a disease or well-established hallmarics of 
an existing disease. For example, indicators of an infection include fever, pus, 
S microorganism positive cultures, inflammation, and the like. Infections that may be treated 
with a hybrid polypeptide of the subject invention include, without limitation, those caused 
by or due to microorganisms, whether the infection is primary, secondary, opportunistic, or 
the like. Examples of microorganisms include Gram-positive bacteria, such as 
streptococci, 

10 The pharmaceutical compositions that contain one or more hybrid 

polypeptides may be in any form that allows for the composition to be administered to a 
subject, such as a human or animal. For example, multivalent hybrid polypeptide 
compositions of the present invention may be prepared and administered as a liquid 
solution or prepared as a solid form (e,g,, lyophilized), which may be administered in solid 

15 form, or resuspended in a solution in conjunction with administration. The hybrid 
polypeptide composition is formulated so as to allow the active ingredi^ts contained 
therein to be bioavailable upon administration of the composition to a subject or pati^t or 
bioavailable via slow release. Compositions that will be administered to a subject or 
patient take the form of one or more dosage units, where for example, a tablet may be a 

20 single dosage unit, and a container of one or more compounds of the invention in aerosol 
form may hold a plurality of dosage units. In certain preferred embodiments, any of the 
aforementioned pharmaceutical compositions comprising a hybrid polypeptide or cocktail 
of hybrid polypeptides of the invention are in a container, preferably in a sterile container. 

In one embodiment, the therapeiitic composition is administered orally, and 

25 a hybrid polypeptide or cocktail composition of the invention is taken up by cells, such as 
cells located in the lumen of the gut. Other typical routes of administration include, 
wiilioui hmiialion, enicial, parcuicral, liansdemial/transmucosal, and iiihalaiion. The term 
enteral, as used herein, is a route of administration in v^ich the agent is absorbed through 
the gastrointestbal tract or oral mucosa, including oral, rectal, and sublingual. The term 
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parenteral, as used herein, describes administration routes that bypass the gastrointestinal 
tracts including intraarterial, intradermal, intramuscxilar, intranasal, intraocular, 
intraperitoneal, intravenous, subcutaneous, submucosal, and intravaginal injection or 
infusion techniques. The term transdermal/transmucosal, as used herein, is a route of 
S administration in which the agent is administered through or by way of the skin, including 
topical. The term inhalation encompasses techniques of administration in which an agent is 
introduced into the pulmonary tree, including intrapulmonary or transpulmonary. 
Preferably, the compositions of the present invention are administered intramuscularly. 

Depending upon the appUcation, the dose of hybrid polypeptide in the 

10 compositions will vary, generally, from about 10 fig to about 10 mg, preferably from about 
100 |ig to 1 mga more preferably from about 150 ^g to 500 jig, and most preferably from 
about 200 fig to about 400 (ig, in combination with the biologically acceptable excipient, 
adjuvant, binder, and/or diluent, including any integer with the dosing range. As used 
herein, the term "about" or "consists essentially of refers to ± 10% of any indicated 

15 structxire, value, or range. Booster immunizations may be given at multiple times, at 
desired intervals ranging from about 2 weeks to about 24 weeks, preferably 2, 4 and 8 week 
intervals, and more preferably 2, 4, and 16 week intervals, and even more preferably 0, 4, 
and 24 week intervals to maximize the immune response. 

The invention further provides a plurality of antibodies produced by the 

20 method for preventing a microbial infection that comprises administering to a subject a 
composition of the subject invention at a dose sufficient to elicit antibodies specific for one 
or more hybrid polypeptide, wherein the antibodies are opsonic and are not tissue cross- 
reactive. In one embodiment, the antibodies comprise at least one antibody specific for a 
M serotype not represented in a hybrid polypeptide, such as type 4 M protein. In another 

25 embodiment, a method for treating or preventing a microbial infection comprises 
administering to a subject a composition comprising a pharmaceutically acceptable canier 
ahd a pluialii> of antibodies of ilic sutjcci in\cntion. 
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Antibodies and Assays 

In another aspect, the hybrid polypeptides and variants thereof of the present 
invention are utilized to elicit antibodies specific for at least one epitope present on the 
hybrid polypeptides provided herein. Accordingly, the present invention also provides 
S such antibodies. In preferred embodiments the antibodies bind to specific protective 
epitopes present on a M protein. Within the context of the present invention, the term 
**antibodies" includes polyclonal antibodies, monospecific antibodies, monoclonal 
antibodies, anti-idiotypic antibodies, fi*agments thereof such as F(ab*)2 and Fab fragments, 
and recombinantly or synthetically produced antibodies. Such antibodies incorporate the 

10 variable regions that permit a monoclonal antibody to specifically bind, which means an 
antibody is able to selectively bind to a peptide produced from an emm or spa sequence of 
this invention. '*Specific for" refers to the ability of a protein an antibody) to 
selectively bind a polypeptide or peptide encoded by an emm or spa nucleic acid molecule 
or a synthesized hybrid polypeptide of this invention. Association or "binding" of an 

15 antibody to a specific antigen generally involve electrostatic interactions, hydrogen 
bonding, Van der Waals interactions, and hydrophobic interactions. Any one of these or 
any combination thereof can play a role in the binding between an antibody and its antigen. 
Such an antibody generally associates with an antigen, such as M protein, with an affinity 
constant (Kg) of at least 10^, preferably at least 10\ more preferably at least 10^ still more 

20 preferably at least 10^ and most preferably at least 10^. Affinity constants may be 
determined by one of ordinary skill in the art using well-known techniques {see Scatchard, 
Ann. N.Y. Acad ScL 57:660-672, 1949). The affinity of a monoclonal antibody or antibody 
can be readily determined by one of ordinary skill in the art (see Scatchard, Ann, N,Y, 
Acad ScL 57:660-672, 1949). 

25 In addition, the term "'antibody/' as used herein, includes naturally occurring 

antibodies as well as non-naturally occurring antibodies, including, for example, single 
chain antibodios, chiuicric, biiauciional and humajiizcd araibodics, as well :is antigen- 
binding fragments thereof. Such non-naturally occurring antibodies may be constructed 
using solid phase peptide synthesis, may be produced recombinantly, or may be obtained, 
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for example, by screening combinatorial libraries consisting of variable heavy chains and 
variable light chains (Huse ei al.. Science 24^:1275-1281 (1989)). These and other 
methods of making, for example, chimeric, humanized, CDR-grafted, single chain, and 
bifunctional antibodies are well known in the art (Winter and Harris, Immunol Today 
5 74:243, 1993; Ward et al.. Nature 341:544, 1989; Harlow and Lane, Antibodies: A 
Laboratory Manual^ Cold Spring Harbor Laboratory, New Yoik, 1992; Borrabeck, 
Antibody Engineering, 2d ed,, Oxford Univ. Press, 1995; Hilyard et al^ Protein 
Engineering: A practical approach, IRL Press, 1 992). 

In a preferred embodiment, a plurality of antibodies comprises two or more 

10 different antibodies wherein each antibody is specific for a different immunogenic peptide 
of a hybrid polypeptide, the polypeptide comprises at least six different immunogenic 
peptides Unked in tandem, each peptide comprises at least 30 amino acids and the amino- 
terminal peptide is reiterated as a carboxy-terminal peptide, wherein the polypeptide is 
capable of eliciting an immxme response against more than one serotype of group A 

15 streptococci. Preferably, a hybrid polypeptide is enable of eliciting an immune response 
against at least serotypes 5, 6, 14, 19, 24, and 29; or at least against serotypes 2, 1 1, 22, 33, 
43, 59, and 94; or at least against serotypes 75, 76, 77, 89, 92, 101, and 114; or at least 
against serotypes 1.0, L2, 3, 12, 18, and 28. In another preferred embodiment, a hybrid 
polypeptide is capable of eliciting an immune response against a M serotype not 

20 represented in the hybrid polypeptide, such as serotype 4, More preferably the antibodies 
are elicited by the hybrid polypeptides of SEQ ID N0S:2, 4, 6, and 8, individually or in 
combination, and most preferably at least one antibody is opsonic and not tissue cross- 
reactive in a subject As used herein, **opsonic" means any epitope that enhances 
phagocytosis of a cell or particle having the epitope. As commonly understood by those 

25 having ordinary skill in the art, "opsonic antibodies" are antibodies that facilitate 
phagocytic activity of a particle having the antigen, such as a bacterial cell. In yet another 
preferred cniboJirnenl, ilic antibodies cliciwd by ilic hybrid poiypepiidcs of liic invcnlioii 
are polyclonal. 
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Polyclonal antibodies can be readily generated by one of ordinary skill in 
the art from a variety of warm-blooded animals such as horses, cows, goats, sheep, dogs, 
chickens, turkeys, rabbits, mice, or rats. Briefly, the desired hybrid polypeptide or 
mixtures of hybrid polypeptides, or variants thereof are administered to immunize an 
S animal through parenteral, intraperitoneal, intramuscular, intraocular, or subcutaneous 
injections. The immunogenicity of the hybrid polypeptide of interest may be increased 
through the use of an adjuvant, such as alum and Freund's complete or incomplete 
adjuvant. Following several booster immunizations over a period of weeks, small samples 
of serum are collected and tested for reactivity to the desired M peptide or Spa peptide. 

10 Particularly preferred polyclonal immune sera give a signal that is at least three times 
greater than background. Once the titer of the animal has reached a plateau in terms of its 
reactivity to the hybrid polypeptide, larger quantities of polyclonal immune sera may be 
readily obtained either by weekly bleedings or by exsanguinating the animal. 

For example, the hybrid polypeptides of SEQ ID N0S:2, 4, 6, and 8 (see 

IS Figure 1) were purified, mixed in equimolar concentrations, and formulated with alimi to 
contain 400 fig total protein and 750 \i% alum in each O.S ml dose. Three rabbits each 
received three intramuscular injections of the hybrid polypeptide cocktml at either 0, 4, and 
8 weeks or 0, 4, and 16 weeks, and immune serum was recovered at 18 weeks. The 
harvested sera were analyzed by ELISA, as described herein, using the purified 

20 recombinant dimeric peptide components of each of the larger vaccine polypeptides. Using 
ELISA (with purified recombinant dimeric immunogenic peptides of the hybrid molecule), 
the immune sera from rabbits immunized at 0, 4, and 1 6 weeks were found to contain high 
titers of antibodies against the vast majority of the M peptides and the Spa peptide 
contained in the hybrid polypeptides of the cocktail (Figure 3). In certain preferred 

25 embodiments, the polyclonal antibodies include those that are specific for group A 
streptococci of serotypes 5, 6, 14, 19, 24, and 29, or of serotypes 2, 1 1, 22, 33, 43, 59, and 
94, or oi' serotypes 7i, 7u, 77, by, 92, 101, and 1 14, oi of scroiypcs 1.0, 1.2, 3, 12. 18, and 
28. 



27 



wo 03/065973 



PCTAJS02/34769 



Monoclonal antibodies may also be readily generated using well-known 
techniques {see U.S. Patent Nos. RE 32,01 1, 4,902,614, 4,543,439, and 4,41 1,993; see also 
Monoclonal Antibodies, Hybridomas: A New Dimension in Biological Analyses^ Plenum 
Press, Kennett, McKearn, and Bechtol (eds.)> 1980, and Antibodies: A Laboratory Manual^ 
5 Harlow and Lane (eds.), Cold Spring Harbor Laboratory Press, 1988). Briefly, in one 
embodiment, a subject animal such as a rat or mouse is injected with a desired protein or 
peptide. If desired, various techniques may be utilized in order to increase the resultant 
immune response generated by the protein, in order to develop greater antibody reactivity. 
For example, the desired protein or peptide may be coupled to another carrier protein (such 
10 as ovalbumin, keyhole limpet hemocyanin (KLH), or E. coli labile toxin B subunit) or 
through the use of adjuvants (such as alum or Freund's complete and incomplete adjuvant) 
and the like. 

Once suitable antibodies have been obtained, they may be isolated or 
purified by many techniques well knov^ to those of ordinary skill in the art (jrce 

1 S Antibodies: A Laboratory Manual, Harlow and Lane, supra). Suitable techniques include 
peptide or protein affinity columns, HPLC or RP-HPLC, purification on protein A or 
protein G columns, or any combination of these techniques. Within the context of the 
present invention, the term "isolated" as used to define antibodies or antibodies means 
^^substantially free of other blood components." 

20 Several assays are available as described herein to examine the activity of 

the antibodies elicited by the hybrid polypeptides of the subject invention. An exemplary 
assay is an opsonophagocytosis assay, which detects phagocytosis facilitated by the 
presence of opsonic antibodies present in test antisera. Briefly, the assay measures the 
amount of phagocytosis of selected bacterial cells by neutrophils after preincubating the 

25 cells in the presence or absence of antisera raised against, for example, hybrid polypeptide 
immunogens. Preincubation with the immune sera coats the cells with M protein reactive 
aiiilbodits, some of wliich will be opsonic aiuiboaicci elicilcd fioii; opsonic epitopes prc^cui 
on the M protein antigens. Preincubated, coated cells are then mixed with whole blood 
from an animal, typically a mammal for which opsonic protection is to be sought (e.g., a 
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human) to detennine the percentage of neutrophils that associate with the bacterial cells, 
which is a measure of phagocytic activity facilitated by opsonic antibodies. Immune sera 
containing opsonic antibodies induce a higher percentage of neutrophils associated with the 
selected bacteria than does immune sera lacking opsonic antibodies. In a variation of this 
5 test, the bactericidal activity of immune sera may be tested by incubating the immune sera 
with fewer bacterial cells, incubating in blood for a longer period of time, and then plating 
the mixture on a culture medium to score for viable bacteria. The presence of opsonic 
antibodies in the immune sera increase the number of bacteria destroyed by phagocytosis 
and, therefore, lowers the nxmiber of colony forming units (CPUs) detected on the plate 
10 culture. Another exemplary assay analyzes bactericidal activity of the test antibodies {see 
Example 5). 

Nucleic Aod Molecules and Host Cells 

The invention also encompasses isolated nucleic acid molecules comprising 
a sequence encoding a hybrid polypeptide wherein each peptide comprises an amino- 

15 terminal portion of a streptococcal M protein {e.g., SEQ ID N0S;2, 4, 6, 8, 10. 12, 14, or 
16). Also provided by the present invention are nucleic acid expression constructs, and 
host cells containing such nucleic acids, which encode hybrid polypeptide and variants 
thereof, which hybrid polypeptides are capable of eliciting an immune response against 
more than one serotype of group A streptococci. This aspect of the invention pertains to 

20 isolated nucleic acid sequences encoding a hybrid polypeptide sequence as described 
herein, as well as those sequences readily derived from isolated nucleic acid molecules 
such as, for example, complementary sequences, reverse sequences and complements of 
reverse sequences. 

^'Nucleic acid" or "nucleic acid molecule" refers to any of deoxyribonucleic 
25 acid (DNA), ribonucleic acid (RNA), oligonucleotides, fragments generated by the 
polymerase chain reaction (PGR), and fragments gencraied by any oi* ligation, scission, 
endonuclease action, and exonuclease action. Nucleic acids may be composed of 
monomers that are naturally occurring nucleotides (such as deoxyribonucleotides and 
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ribonucleotides), analogs of naturally occurring nucleotides {e.g„ a-enantiomeric forms of 
naturally-occurring nucleotides), or a combination of both. Modified nucleotides can have 
modifications in sugar moieties and/or in pyrimidine or purine base moieties. Sugar 
modifications include, for example, replacement of one or more hydroxyl groups with 
S halogens, alkyl groups, amines, and azido groups, or sugars can be fimctionalized as ethers 
or esters. Moreover, the entire sugar moiety may be replaced with sterically and 
electronically similar structures, such as aza-sugars and carbocyclic sugar analogs. 
Examples of modifications in a base moiety include alkylated purines and pyrimidines, 
acylated purines or pyrimidines, or other well-known heterocyclic substitutes. Nucleic acid 

10 monomers can be linked by phosphodiester bonds or analogs of such linkages. Analogs of 
phosphodiester linkages include phosphorotUoate, phosphorodithioate, 
phosphoroselenoate, phosphorodiselenoate, phosphoroanilothioate, phosphoranilidate, 
phosphoramidate, and the like. The term ''nucleic acid'' also includes so-called '"peptide 
nucleic acids,'' which comprise naturally occurring or modified nucleic acid bases attached 

15 to a polyamide backbone. Nucleic acids can be either single stranded or double stranded. 

Further, an "isolated nucleic acid molecule'' refers to a polynucleotide 
molecule in the form of a separate fragment or as a component of a larger nucleic acid 
construct, which has been separated from its source cell (including the chromosome it 
normally resides in) at least once in a substantially pure form. For example, a DNA 

20 molecule that encodes a Spa polypeptide, peptide, or variant thereof, which has been 
separated from a Streptococcus cell or from the genomic DNA of a Streptococcus cell, is 
an isolated DNA molecule. Another example of an isolated nucleic acid molecule is a 
chemically synthesized nucleic acid molecule. Nucleic acid molecules may be comprised 
of a wide variety of nucleotides, including DNA, cDNA, RNA, nucleotide analogues, or 

25 some combination thereof. In one preferred embodiment, an isolated nucleic acid molecule 
comprises a sequence encoding a hybrid polypeptide of SEQ ID N0S:2, 4, 6, 8, 10, 12, 14, 
or iv. 

In certain aspects, the invention relates to nucleic acid vectors and 
constructs that include nucleic acid sequences of the present invention, and in particular to 
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"nucleic acid expression constructs" include any polynucleotide encoding a hybrid 
polypeptide as provided above; to host cells that are genetically engineered with vectors 
and/or constructs of the invention and to the production and use in methods for treating or 
preventing a microbial infection or eliciting an immune response. The hybrid polypeptides 
5 may be expressed in mammalian cells, yeast, bacteria or other cells under the control of 
appropriate expression control sequences. Cell-free translation systems may also be 
employed to produce such proteins using RNAs derived from the nucleic acid expression 
constructs of the present invention. Appropriate cloning and expression vectors for use 
with prokaryotic and eukaryotic hosts are described^ for example, by Sambrook etal.y 

10 Molecular Cloning: A Laboratory Manual^ Second Edition, Cold Spring Harbor, NY, 
(1989), and may include plasmids, cosmids, shuttle vectors, viral vectors and vectors 
comprising a chromosomal origm of replication as disclosed therein. In one preferred 
embodiment, a nucleic acid expression construct comprises an expression control sequmce 
operably linked to a polynucleotide encoding a hybrid polypeptide of SEQ ID N0:2, 4, 6, 

15 8, 10, 12, 14, or 16, In another preferred embodiment, the nucleic acid repression 
construct has an inducible promoter, which may be lac, tac, trc, arc, trp^ X phage, T7 
phage, and T5 phage promoter, and more preferably is a T5 phage promoter//ac operator 
expression control sequence as set forth in SEQ ID NO: 17. The "expression control 
sequence" refers to any sequences sufficient to allow expression of a protein of interest in a 

20 host cell, including one or more promoter sequences, enhancer sequences, operator 
sequences {e.g., lacO), and the like. In certain embodiments, the hybrid polypeptide- 
encoding nucleic acid is in a plasmid, more preferably in plasmid pT5 (SEQ ID NO: 17) 
and the host cell is a bacteriiun, most preferably Escherichia colL 

It should be understood ttiat hybrid polypeptide-encoding nucleic acid may 

2S be a variant of the natural sequ^ce due to, for example, the degeneracy of the genetic code 
(including alleles). Briefly, such ^Variants** may result from natural polymorphisms or may 
be ,synllicbizcu by i ccombinanl lociliouoloji}- {e.g., lo ubiain codon opuiiiizalion fur 
expression in a particular host) or chemical synthesis, and may differ from wild-type 
polypeptides by one or more amino acid substitutions, insertions, deletions, or the like. 
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Variants encompassing cons^ative amino acid substitutions include, for example, 
substitutions of one aliphatic amino acid for another, such as He, Val, Leu, or Ala or 
substitutions of one polar residue for another, such as between Lys and Arg, Glu and Asp, 
or Gin and Asn. Such substitutions are well known in the art to provide variants having 
S similar physical properties and functional activities, such as for example, the ability to 
elicit and cross-react with similar antibodies. Other variants Include nucleic acids 
sequences that encode a hybrid polypeptide having at least 50%, 60%, 70%, 80%, 90% or 
95% amino acid identity to SEQIDN0S:2, 4, 6, 8, 10, 12, 14, or 16. Preferred 
embodiments are those having greater than 90% or 95% identity with the amino acid 

10 sequence of SEQ ID N0S:2, 4, 6, 8, 10, 12, 14, or 16. As will be appreciated by those of 
ordinary skill in the art, a nucleotide sequence encoding an hybrid polypeptide or variant 
thereof may differ from the native sequences presented herein due to codon degeneracy, 
nucleotide polymorphism, or nucleotide substitution, deletion or insertion. Thus, in certain 
aspects the present invention includes all degenerate nucleic acid molecules that encode 

15 polypeptides and peptides comprising the amino acid sequence of SEQ ID N0S:2, 4, 6, 8, 
10, 12, 14, or 16. In another aspect, included are nucleic acid molecules that encode hybrid 
polypeptide variants having conservative amino acid substitutions or deletions or 
substitutions such that the hybrid polypeptide variant retains at least one epitope capable of 
eliciting antibodies specific for one or more streptococcal serotypes. 

20 In certain aspects, a nucleic acid sequence may be modified to encode a 

hybrid polypeptide variant wherein specific codons of the nucleic acid sequence have been 
changed to codons that are favored by a particular host and can result in enhanced levels of 
expression {see, e.g., Haas et al., Curr, Biol 5:315, 1996; Yang et aL, Nucleic Acids Res, 
2^:4592, 1996). For example, certain codons of the immunogenic peptides obtained from 

25 streptococcal M proteins were optimized, without changing the primary sequence of the 
peptides, for improved expression in Escherichia cob\ By way of illustration and not 
liiuiiutiun, clw^^cn of liic ihiiiocu Ltiginino (Arg) coduh:j oi AGG/AGA in ihc hexavalcni 
hybrid polypeptide as set forth in SEQ ID NO: 9 were changed to the Arg codons of 
CGT/CGC as set forth in SEQ ID N0:1. Similarly, twelve of twenty AGG/AGA Arg 
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codons of SEQID N0:15. were optimized to CGT/CGC codons as set forth in 
SEQ ID N0:8; seven of thirteen AGG/AGA Arg codons of SEQ ID N0:1 1 were optimized 
to CGT/CGC codons as set forth in SEQ ID N0:3; and five of nine AGG/AGA Arg codons 
of SEQ ID NO: 13 were optimized to CGT/CGC codons as set forth in SEQ ID N0:5. As 
S is known in the art, codons may be optimized for whichever host the hybrid polypeptide is 
to be expressed in, including without limitation bacteria, fiingi, insect cells, plant cells, and 
mammalian cells. Additionally, codons encoding different amino acids may be changed as 
well, wherein one or more codons encoding different amino acids may be altered 
simultaneously as would best suit a particular host (e.g., codons for arginine, glycine, 

to leucine, and serine may all be optimized or any combination thereof). Alternatively, codon 
optimization may result in one or more changes in the primary amino acid sequence, such 
as a conservative amino acid substitution, addition, deletion, or combination thereof. 

While particular embodiments of isolated nucleic acids encoding hybrid 
polypeptides are depicted in SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, and 15, within the context of 

15 the present invention, reference to one or more isolated nucleic acids includes variants of 
these sequences that are substantially similar in that they encode native or non-native 
hybrid polypeptides with similar structure and ability to elicit serospecific antibodies to at 
least one immunogenic peptide subunit contained in the hybrid polypeptides of SEQ ID 
N0S:2, 4, 6, 8, 10, 12, 14, or 16. As used herein, the nucleotide sequence is deemed to be 

20 "substantially similar" if: (a) the nucleotide sequence is derived from the coding region of a 
emm gene isolated from a streptococcus (including, for example, portions of the sequence 
or allelic variations of the sequences discussed above) and contains a M protein epitope 
with substantially the same ability to elicit opsonic antibodies protective against 
streptococci that are not tissue cross-reactive; (b) the nucleotide sequence is capable of 

25 hybridization to the nucleotide sequences of the present invention under moderate or high 
stringency; (c) the nucleotide sequences are degenerate sequences which code for the 
soin^: aiiiiuo acius using a diflcrciU coJon scwiuencos) as a r^suli of the genciic code to the 
nucleotide sequences defined in (a) or (b); or (d) is a complement of any of the sequences 
described in (a), (b) or (c). 
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As used herein, two nucleotide sequences are said to "hybridize'' under 
conditions of a specified stringency when stable hybrids are formed between substantially 
complementary nucleic acid sequences. Stringency of hybridization refers to a description 
of the environment under which hybrids are annealed and washed, which typically includes 
5 ionic strength and temperature. Other factors that might affect hybridization include the 
probe size and the length of time the hybrids are allowed to form. For example, "high/' 
"medium" and "low" stringency encompass the following conditions or equivalent 
conditions thereto: high stringency is 0.1 x SSPE or SSC, 0.1% SDS, 65**C; medium 
stringency is 0.2 x SSPE or SSC, 0.1% SDS, SOX; and low stringency is 1.0 x SSPE or 

10 SSC, 0.1% SDS, 50°C. As used herein, the term "high stringency conditions" means that 
one or more sequences will remain hybridized only if there is at least 95%, and preferably 
at least 97%, identity between the sequences. In preferred embodiments, the nucleic acid 
sequences that remain hybridized to a hybrid polypeptide-encoding nucleic acid molecule 
encode polypeptides that retain at least one epitope of a hybrid polypeptide encoded by a 

15 nucleic acid of any one of SEQ ID N0S:1, 3, 5, 7, 9, 1 1, 13, and 15. 

Methods for producing the hybrid polypeptides of the subject invention are 
provided as well wherein any of the nucleic acid molecules and host cells described herem 
may be used. In a preferred embodiment, a method of producing a hybrid polypeptide 
comprises culturing a host cell containing a nucleic acid expression vector comprising at 

20 least one expression control sequence operably linked to a nucleic acid molecule encoding 
a hybrid polypeptide, such as a hybrid polypeptide as set forth in any one of 
SEQIDN0S:2, 4, 6, 8, 10, 12, 14, or 16, under conditions and for a time sufficient for 
expression of the polypeptide. In one particularly preferred embodiment, a hybrid 
polypeptide is produced by this method, and more preferably the hybrid polypeptides 

25 produced are of SEQ ID NOS:10, 12, 14, or 16, and more preferably the hybrid 
polypeptides produced are of SEQ ID N0S:2, 4, 6, or 8. 

The following examples arc oftcrcc; by way of illuslratiori aaJ not b) v*ay of 

limitation. 
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EXAN4PLES 
EXAMPLE 1 

Generation of Recombinant Multivalent and Individual Streptococcal proteins 

Once the specific S' sequences of each emm and spa gene had been selected 
5 for inclusion in the vaccine^ they were used to design four hybrid nucleic acid molecules, 
each containing 6-7 emm and/or spa coding sequences linked in tandem by unique 
restriction enzyme recognition sites (Figures 1-5). The four hybrid nucleic acid molecules 
were constructed using PCR-generated emm or spa nucleic acid molecules that were 
amplified from streptococcal genomic DNA of the corresponding serotype using 

10 oligonucleotide forward and reverse primers containing restriction enzyme sites at the 5' 
end. The PCR-generated fragments were purified, digested with the appropriate restriction 
enzymes, ligated using methods previously described (Dale et al, J, Immunol /J/:2188, 
1993; Dale, Vaccine 77:193, 1999), and then sequentially cloned into the expression vector 
pT5, Plasmid pT5 (SEQ ID NO: 17) comprises a bacteriophage T5 promoter operably to 

15 lac operators, which means expression from the T5 promoter can be induced with 
isopropyl-beta-D-thiogalactopyranoside (IPTG). 

The emm fragment was reiterated at the 3' end of each hybrid nucleic 
acid molecule based on observations that an amino-terminal M protein peptide reiterated on 
the caiboxy-terminus of a hybrid polypeptide appears to enhance or protect the 

20 immunogenicity of the adjacent M protein peptides {see Dale, Vaccine 77:193, 1999; 
WO 99/13084). In addition, the emm nucleic acid molecule cloned at the 3*-end of the 
hybrid molecule was engineered to include at the 5' end the following: (a) at least six 
histidine codons, (b) a Xhol restriction enzyme site, (c) optionally one or more amino acid 
codons {e,g., cysteine), and (d) at least one stop codon (TAA or TAG). Several of the Arg 

25 codons from rr^ h of the hexavfilent and soptDvalent polypeptides wore optimized {sec 
Figures 2-5), without changing the primary amino acid sequence, for expression in £ colU 
which yielded from about a 2-fold to about a 10-fold increase in polypeptide production. 
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The nucleic acid and amino acid sequences of the hybrid polypeptides used to make 
admixtures for immunizing rabbits and/or humans are set forth in SEQ ID N0S:1-16. Each 
expression plasmid construct of pT5 was used to transform £1 coli strain JM105 (genotype 
F' (raD36 lacJ^£i(lacZ)m5proA^B^/tM rpsL (strO endA sbcBXS sbca fedW4(rk"niic-) 
5 A(laC'proABy The sequence identity of each hybrid DNA molecule transformed into 
JM105 E, coli was verified by sequencing both strands. Expression of each component 
fusion protein was detected by SDS-PAGE analysis using whole cell lysates before and 
after 1 mM IPTG induction. 

In addition to the hybrid multivalent polypeptides generated to immunize a 

10 particular subject, recombinant homodimeric peptides comprising a single serotype 
immunogenic peptide were expressed and purified to test immune sera elicited by the 
hybrid streptococcal polypeptides. Each emm or spa gene fragment, included in the hybrid 
nucleic acid molecules described above, was independently amplified by PGR, purified, 
and cloned sequentially into the expression vector pTS as an in-frame dimer with a 

15 restriction enzyme site between each coding sequence. Each PCR-generated sequence was 
verified by sequencing both strands of the dimer-encoding nucleic acid molecule. 
Expression of each peptide in transformed JM105 E, coli was detected by SDS-PAGE 
analysis, as described above. 

Each hybrid polypeptide encoded by the hybrid nucleic acid molecules was 

20 further analyzed by BlastP (matrix BLOSUM62) to assure that there were no significant 
homologies with human proteins in the GenBank database. The linking restriction enzyme 
sites between the emm and/or spa encoding nucleic acid molecules were selected to avoid 
creating a sequence encoding potential human tissue cross-reactive epitopes. Specifically, 
the two amino acid residues encoded by each restriction enzyme site along with the six 

25 flanking M protein or Spa residues on each side of the sites (14 residues in total) were 
searched using BlastP (matrix PAM30) to detect potential homologies with human proteins 
ill ii'tc GoiiBajik daiaba.sc. Any icsliiciion enzjme siic ii;a.Jiiug more liian foui* coiuigiious 
amino acids of a hirnian protein sequences was not used. 
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EXAMPLE 2 

Purification of Multivalent and Individual Streptococcal proteins 

Each hybrid polypeptide and individual dimeric peptide was purified 
separately. Cell paste of E coli JM105 expressing His-tagged hybrid polypeptide was 
5 lysed in phosphate buffered saline (PBS) by microfluidation (Microfluidics, Inc., Newton, 
MA). After centrifugation, the clarified lysate was batch adsorbed to nickel-loaded affinity 
chelate resin (Tosoh Biosep, Montgomeryville, PA), washed and eluted with a step 
gradient of imidazole in PBS. Fractions containing the eluted hybrid polypeptide were 
pooled, pumped through a 5 ml HITRAP® Q anion exchange cartridge (Amersham 

10 Pharmacia Biotech, Piscataway, NJ) and concentrated in a stir cell (Millipore, Bedford, 
MA). The concentrated hybrid polypeptide was further purified by size exclusion 
chromatography using a Superdex 200 column (600 cm length, Amersham Pharmacia 
Biotech) equilibrated with PBS, Fraction purity was monitored using SDS-PAGE and 
firactions containing pure hybrid polypeptide were pooled and stored at -20^C until use. 

15 Purity, identity and concentration of the hybrid polypeptide were further assessed by 
reverse-phase HPLC, amino acid analysis, and electrospray mass spectrometry. 

Individual dimeric peptides were similarly purified, but with notable 
differences. £ coli JM105 expressing 6xHis-tagged dimeric peptide were lysed in PBS 
containing 8M urea for 3 hours with stirring. After centrifugation, the clarified lysate was 

20 batch adsorbed to nickel-loaded affinity chelate resin (Tosoh Biosep) and washed and 
eluted with a step gradient of imidazole in PBS. The eluted dimeric peptide was then 
loaded onto a preparative reverse-phase C4 column (Vydac, Hesperia, CA), washed^ and 
eluted with increasing concentrations of acetonitrile in water containing 0.1% 
trifluoroacetic acid. Fractions of eluted dimeric peptide were monitored using SDS-PAGE. 

25 Fractions containing purified dimeric peptide were pooled and dialyzed against PBS before 
storage at -20''C. 
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EXAMPLES 

Formulation of Hybrid Polypeptide Cocktail and Immunization of Rabbits 

The four multivalent polypeptides (Hexa A.l [SEQ ID NO: 10], Septa B.2 
[SEQ ID N0:16], Septa C.2 [SEQ ID N0:4], and Septa D. 1 [SEQ ID N0:14]) were mixed 
in equimolar amounts and adsorbed to alum (REHYDRAGEL®, low viscosity, Reheis, 
Inc., Berkeley Heights, NJ) to achieve a final protein concentration of 800 ng/ml and a 
final alum concentration of 1.5fig/mL This cocktail of immunogenic polypeptides 
represent at least 27 antigens. 

New Zealand white rabbits were each immunized with 400 (i.e,, about 
100 ^g of each multivalent polypeptide) of the vaccine via the intramuscular route at either 
0, 4, and 8 weeks or 0, 4, and 16 weeks {see Dale, Vaccine 77:193, 1999), Serum was 
obtained prior to the first injection and two weeks after the final injection. 

EXAMPLE 4 

ELIS A Using Serum from Immunized Rabbits 

Type-specific antibodies were detected by enzyme-linked immunosorbent 
assays (ELISAs) essentially by methods previously described (McLellan et al^ Infect 
Immun, 5P:2943, 2001). Briefly, microliter wells were coated with purified recombinant 
dimeric M peptides (i.e., copying the vaccine subunits and used as solid-phase antigens). 
Wells without peptide but containing all other reagents served as negative controls. The 
ELISAs were performed using pre-immime and immune rabbit sera. The sera were serially 
dUuted in PBS (pH 7,4) with 0,05% TWEEN® 20, added to the wells, and incubated at 
37''C for 2 hours. The wells were washed with 0. 15% salihe-TWEEN® 20. A horse radish 
peroxidase-conjugated goat immunoglobulin G (IgG) to rabbit immunoglobulins (IgG, 
IsA. and fgM) ffCN Biomedicals* Aurora, OHl diluted 1 :2.fl00 was added and incubated at 
37*C for 2 hours. The wells were then washed, 5-aminosalicylic acid was added, and the 
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-^450 was recorded after 15 minutes in an MR 600 microplale reader (Dynatech 
Laboratories^ Inc., Chantilly, VA), 

The immune sera from rabbits (obtained 2 weeks aflet the final 
immunization) contained high titers of antibodies to the majority of the M peptides 
5 contained in the vaccine (Figure 7). Antibody titers were determined for each of the 26 M 
peptides and Spa, a new protective antigen of group A streptococci (Dale et aly J. Clin, 
Invest. 130:1261, 1999). All pre-immune titers were less than 200, Out of the 81 immune 
serum titers determined (27 antigens x 3 rabbits), 69 titers (85%) increased by four-fold or 
greater over the pre-immune levels (Figure 7). 

10 EXAMPLES 

Opsonization Assays Using Serum from Immunized Rabbits 

Opsonic antibodies were detected by in vitro opsonization assays, 
essentially as previously described (Beachey et al,^ J. Exp. Med. 1 4 5:1469 y 1977). The test 
mixture consisted of 0,05 ml of a standard suspension of streptococci grown to mid-log 

15 phase, 0.05 ml test serum, and 0.2 ml whole, heparinized (10 U/ml) nonimmune human 
blood. For these assays, the number of streptococcal CFU per leukocyte was 
approximately 10. The tubes were rotated end-over-end for 45 minutes at 37*^C« Smears 
were then made on glass slides and stained with Wright's stain (Sigma Diagnostics, St. 
Louis, MO). Opsonization was quantitated by counting 50 consecutive neutrophils and 

20 calculating the percentage with associated streptococci (percent opsonization). 

The pre-immune sera from all three rabbits resulted in <10% opsonization 
of each of the 26 serotypes tested (data not shown), indicating that the donor blood used for 
these assays did not contain antibodies against the test organism and that each organism 
was fully resistant to opsonization in nonimmune blood. Using 30% opsonization in the 

25 presence of immune serum as a posiiive threshold result {i.e., ihree or more limes the pre- 
immune level), 1 8 of the 26 serotypes (69%) were opsonized by at least one of three 
immune rabbit sera (Figure 8). 
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EXAMPLE 6 

Bactericidal Assays Using Serum from Immunized Rabbits 

Bactericidal assays were performed similar to Example 5 (Lancefield, J. 
Exp. Med. 106:525, 1957) except that 0.05ml of Todd-Hewitt broth containing fewer 
5 bacteria was added to 0.1 ml of test serum and 0.35 ml of blood and the mixture was 
rotated for three hours at 37°C. Then 0.1 ml aliquots of this mixture were added to melted 
sheep's blood agar, pour plates were prepared, and viable organisms (CFU) were counted 
after overnight incubation at 37°C. For each serotype tested, three different inocula were 
used to assure that the growth in blood containing pre-immune serum was optimal and was 

10 quantifiable. The results are expressed as percent killing, which was calculated using the 
following formula: [(CFU after three hours growth with pre-immime senun) - (CFU after 
three hours growth with immune serum)] -r [CFU after three hours growth with pre- 
inunune serum] x 100. Only those assays that resulted in growth of the test strain to at 
least eight generations in the presence of pre-immune serum were used to express percent 

1 5 killing in the presence of immune serum. 

In all experiments, the test mixture containing pre-immune serum resulted in 
growth of the organisms to eight generations or more (data not shown), again indicating 
that the human blood did not contain opsonic antibodies against the test strains and that 
each organism was fully resistant to bactericidal killing in nonimmime blood. Using 50% 

20 reduction in growth after the three-hour rotation in immune serum compared to the pre- 
immune serum (percent killii^), bactericidal activity was observed against 22 of the 26 
serotypes tested (Figure 9). When the results of the opsonization and bactericidal assays 
were combined, 24 of the 26 serotypes (92%) tested were opsonized by the immune sera in 
one or both assays. 
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EXAMPLE? 

Assays to Detect Tissue Cross-Reactive Antibodies in Immunized Rabbits 

Rabbit immune sera raised against a composition comprising a cocktail of 
four different hybrid polypeptides (i.e., a 27-valent vaccine) were tested for the presence of 
5 tissue cross-reactive antibodies by indirect immunofluorescence assays (Dale and Beachey, 
J. Exp. Med 1 61:113, 1985) using frozen sections (4 jun) of human myocardium, kidney, 
basal ganglia, cerebral cortex, and cartilage. The sections were placed on gelatin-coated 
slides and fixed with 1% paraformaldehyde for 10 min. The slides were washed with PBS, 
incubated with the inmiime sera diluted 1:5 in PBS for 30 minutes at room temperature, 

10 and washed thoroughly in PBS. The sections were then incubated with iluorescein- 
conjugated goat anti-rabbit IgG (Cappel, West Chester, PA) at a dilution of 1:40 in PBS for 
30 minutes at room temperature* After washing, the slides were mounted in Gelvatol and 
examined in a fluorescent microscope. Rabbit anti-sera known to cross-react with human 
myocardium, kidney, and brain were used as positive controls and rabbit pre-immune sera 

1 5 as negative controls. 

While the hybrid polypeptides elicited opsonic antibodies to most GrAS 
serotypes tested, none of the hybrid polypeptides elicited human cross-reactive antibodies. 
This indicates that the hybrid polypeptides, alone or in combination, do not contain 
potentially harmful autoimmune epitopes. 

20 EXAMPLE 8 

Bactericidal Activity of Serum from Immunized Rabbits 
Against Other Serotypes 

Type 4 streptococci are relatively conunon causes of uncomplicated 
pharyngitis and invasive infections. Type 4 organisms currently account for 3.8% of all 
25 invasive infections ajid 8.6% of all pharyngitis isolates in the ongoing U.S. surveillance 
program (personal communication, S.T. Shulman). Not wishing to be bound by theory, it 
appears that purified recombinant type 4 M protein either does not evoke opsonic 
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antibodies or the type 4 streptococcal strains are resistant to opsonization. For this reason, 
the emm4 gene fragment was not included in the composition comprising the four hybrid 
polypeptides of Hexa A.1 [SEQ ID NO:10], Septa B.2 [SEQ ID NO: 16], Septa C.2 (SEQ 
ID N0:4], and Septa D.l [SEQ ID N0:14] the27-valent vaccine). To determine 
5 whether any of the antibodies elicited by tiie 27-valent vaccine might be directed against 
cross-reactive opsonic epitopes on the surface of type 4 streptococci, bactericidal assays 
were performed using seven clinical isolates obtained from the U.S. Streptococcal 
Pharyngitis Surveillance Program (Figure 10). 

Interestingly, bactericidal activity was detected against five of the seven 

10 strains of type 4 streptococci. The strains isolated from patients in Florida (FL9) and 
Illinois (IL23 and IL12) were opsonized by both of the 27-valent antisera tested. One of 
the two antisera opsonized both isolates from California (CAS and CA12), while the strains 
from Connecticut (CT5) and South Dakota (SD32) were not opsonized by either antiserum 
(Figure 10). The results suggest that the 27-valent cocktail vaccine can evoke antibodies 

15 that cross-react with protective epitopes on the surfece of some strains of type 4 

streptococci. In addition^ the data indicate that type 4 streptococci may be a heterogeneous 
group of organisms that express different protective epitopes even though they all express 
the type-specific M4 protein. 

EXAMPLE 9 

20 Formulation of Hybrid Polypeptide Cocktail and Immunization of Humans 

The formulated bulk vaccine consists of the four recombinant proteins 
(Hexa A.3 [SEQ ID N0:2], Septa B.3a [SEQ ID N0:8], Septa C.2 [SEQ ID N0:4], and 
Septa D.3 [SEQ ID N0:6]) adsorbed onto aluminum hydroxide (final alum concentration 
of about 1.5 ^ig/m^) and diluted to a target concentration of about 400 ixg/ml or about 
25 800 fig/'ml (100 |.ig/ml or 200 ^ig/ml, respeciively, of each hybrid polypepiide) with 
phosphate buffered saline. This cocktail of immunogenic polypeptides represents at least 
27 antigens. 
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Briefly, the calculated volumes of the purified hybrid polypq)tides in PBS 
are measured out and added to a sterile polystyrene media bottle. The mixture is stirred 
until homogenous, and then diluted with an equal volume of sterile water for injection. 
This dilution step reduces the concentration of Na2HP04 and NaCl in the mixture to the 
5 desired final concentration (5 mM phosphate, 150 mM NaCl, pH 7.5). The pooled 
HYBRID polypeptides are then passed through a sterilizing filter unit (MILLIPAK^ 20, 
Millipore, Bedford, MA) using a peristaltic pump, If some peptides require some special 
conditions to homogenize or dilute, then the solutions are made separately and 
subsequently mixed with an adjustment to pH, as needed. 

10 The required volume of recombinant hybrid polypeptides is measured out, 

and added to a formulation bottle. REHYDRAGEL® (low viscosity, Reheis, Inc., Berkeley 
Heights, NJ) is received as a sterile suspension of aluminum hydroxide in water for 
injection and is used without further preparation. The required volume of 
REHYDRAGEL® is measured out and added to the formulation bottle while stirring- The 

15 pH of the mixture is measured and adjusted to pH 7.5 - 7.7 using 1 M NaOH. Finally, 
formulation buffer is added to achieve ihc correct final volume and the mixture stirred for 
an additional 16-20 hours at room temperature. The bulk vaccine is divided into 
containers (sterile), samples are taken for testing, and the formulated vaccine is stored at 
2-8 °C. 

20 Human volunteers were screened and 30 healthy subjects aged 1 8-50 years 

were enrolled. Each subject was immunized with 400 ^g of the cocktail vaccine 
composition via the intramuscular route at 0, 30, and 120 days. Serum was obtained prior 
to &e first injection and at days 14, 30, 44, 60, 120, 134 and 150. 

EXAMPLE 10 

2f ELISxX IJfINC Silkl'M ( ROM IMML'NIZED IIUMAN S'JBjECTS 

Type-specific antibodies were detected by ELISA, similar to the methods 
described in Example 4 {see also McLellan ei aL, Infect. Immun. ^9:2943, 2001). Briefly, 
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microtiter wells were coated with purified recombinant dimeric M peptides copying 
the vaccine subunits and used as solid-phase antigens). Wells without human sera but 
containing all other reagents served as negative controls. The ELISAs were performed 
using the collected human sera. The sera were serially diluted in PBS (pH 7.4) with 0.1% 
5 BSA and 0.05% TWEEN^ 20, added to the wells, and incubated at 37'C for 2 hours. The 
wells were washed with PBS-0.05%TWEEN^ 20. A horseradish peroxidase-conjugated 
goat immunoglobulin G (IgG) to human immunoglobulins (IgG, IgA, and IgM) (Jackson 
ImmunoResearch Laboratories, West Grove, PA) diluted 1:2,000 was added and incubated 
at STX for 1 hour. The wells were then washed. l-Step™ Turbo TMB-ELISA substrate 

10 (Pierce, Rockford, IL) was added, 1 N sulfuric acid was added after 30 minutes. The 
-^450/595 was recorded in a Vmax*^ microplate reader (Molecular Devices, Sunnyvale, CA). 

All of the immune sera from the human subjects (obtained at Day 134, 
which was two weeks after the third injection on Day 120) showed a statistically significant 
rise in antibody titer for all M antigens and the Spa antigen, with a p value < 0,001 (Figure 

15 11), All subjects had a baseline level pre-inmiune titer for some antigens (Day 0) as seen 
in Figure 1 1 (Day 0, open bars). The geometric mean fold increase in serum antibody to all 
of the antigens in the 27-valent cocktail was 12.6-fold, with a minimum increase of three- 
fold and a maximum increase of over 68-fold. Overall, 26 out of the 27 antigens 
represented in the vaccine evoked at least a four-fold mean increase in antibody titer. 

20 EXAMPLE 11 

Bactericidal Assays Usr^g Serum from Immunized Human Subjects 

Bactericidal assays were performed essentially as described in Example 6 
(see also Lancefield, J. Exp, Med 106:525, 1957). Only those assays that resulted in 
growth of the test strain in the presence of Day 0 serum were used to express percent 
25 killing in the presence of Day 134 scruaii. 

All subjects that showed a rise in antibody titers over baseline level pre- 
immune titers also showed an increase over baseline levels bactericidal activity 
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(Figure 12). The geometric mean antibody titer increase was deteraiined as in Example 10 
and converted to log normal (Ln) for comparison to functional activity (bactericidal killing) 
of the same immune sera. Thus, there is a quantitative correlation between increased 
antibody titers and the ability of antibodies to induce bacterial killing. 

5 EXAMPLE 12 

Assays to Detect Tissue Cross-Reactive Antibodies 
IN Serum from Immunized Human Subjects 

Sera collected from human subjects, who were immunized with a 27-valent 
vaccine composition comprising a cocktail of four different hybrid polypeptides (Hexa A. 3 
10 [SEQ ID N0:2], Septa B.3a [SEQ ID N0:8], Septa C.2 [SEQ ID N0:4], and Septa D,3 
[SEQ ID NO:6]), were tested for the presence of tissue cross-reactive antibodies as 
essentially described in Example 7. 

Similar to the results observed in rabbits, none of the hybrid polypeptides 
elicited human cross-reactive antibodies. This indicates that the hybrid polypeptides, alone 
15 or in combination, do not contain potentially harmful human autoimmune epitopes. 

EXAMPLE 13 

Comparison of Hexavalent Vacgne versus 27- Valent Cocktail Vaccine 
Used to Immunized Human Subjects 

A similar human trial was performed using single hexavalent polypeptide 
20 Hexa 1.2, which has a structure of M24-M5-M6-M19-M1-M3-M24 {see Dale, Vaccine 
77:193, 1999; WO 99/13084), Human volunteers were screened and 11 healthy subjects 
were each immunized three times with 100 ^ig of Hexa 1 .2 in the same formulation buffer 
as used for the 27-valent vaccine composition comprising a cocktail of four different 
hybrid polypeptides (Hexa A.3 [SEQ ID N0:2], Septa B.3a [SEQ ID N0:8], Septa C.2 
25 [SEQ ID N0:4], and Septa D.3 [SEQ ID N0:6]). All six of the group A streptococcal 
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antigens of Hexa 1.2 (M24, M5, M6, M19, Ml, and M3) are represented in two of the four 
multivalent polypeptides that cxmprisc the 27-valent vaccine. ELISAs were then 
performed on sera from immunized subjects to identify type-specific antibodies for each of 
the Hexa 1.2 antigens, essentially as described in Example 10. The geometric mean titers 
were calculated using antibody titers from the 11 subjects that received the Hexa 1.2 
miiltivalent polypeptide and using antibody titers from the 30 subjects in the 27-valent 
cocktail of four different multivalent polypeptides clinical trial, respectively (Figure 13), 
The fold-increase in antibody titers represents the fold rise in geometric mean titers after 
immunization as compared to the geometric mean titers before immunization. 

Surprisingly, the subjects immunized with the 27-valent composition 
showed a much greater fold-increase in antibody titers overall than did the subjects who 
received the hexavalent composition (Figure 13). The 27-valent showed a greater fold- 
increase for all M antigens, ranging from about 1.1- to about a 4-fold increase. As noted 
above, the increase in antibody titer also correlates with an increase in bactericidal activity. 
Hence, the fow multivalent peptides together unexpectedly showed a synergistic effect in 
evoking an immime response as compared to a single hexavalent peptide. 

From the foregoing it will be appreciated that, although specific 
embodiments of the invention have been described herein for purposes of illustration, 
various modifications may be made without deviating from the spirit and scope of the 
invention. Accordingly, the inv^tion is not limited except as by the appended claims. 
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CLAIMS 

1. A hybrid polypeptide, comprising at least six different linked 
immunogenic peptides, wherein each peptide comprises an amino-terminal portion of a 
streptococcal M protein of at least 30 amino acids, and wherein the polypeptide has an amino- 
terminal peptide that is reiterated as a carboxy-terminal peptide and the polypeptide is capable of 
eliciting an immime response against more than one antigen of group A streptococci comprising 
at least M5, M6, M14, M19, M24, and M29. 

2. The hybrid polypeptide according to claim 1 wherein the amino-terminal 
immunogenic peptide of tiie polypeptide is M24, 

3. The hybrid polypeptide according to claim 2 wherein the polypeptide is 
recombinant and Ae immunogenic peptides are linked in tandem, the polypeptide having a 
structure of M24-M5.M6-M19-M29-M14-M24. 

4. A hybrid polypeptide according to claim 3 wherein the polypeptide 
comprises the amino acid sequence of SEQ IDN0:2. 

5. A hybrid polypeptide, comprising at least seven different linked 
immunogenic peptides, wherein each peptide comprises an amino-terminal portion of a 
streptococcal M protein of at least 35 amino acids, and wherein the polypeptide has an amino- 
terminal peptide that is reiterated as a carboxy-terminal peptide and the polypeptide is capable of 
eliciting an immune response against more than one antigen of group A streptococci comprising 
at least M2, Ml 1, M22, M33, M43, M59, and M94, 

0. iiiC iiybiiu polypcpiide accoiuiii^ to cliiuu j wwuisi'in liic jiiiiiio-iciiniiuil 
immunogenic peptide of the polypeptide is M2. 
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7. The hybrid polypeptide according to claim 6 wherein the polypeptide is 
recombinant and the immunogenic peptides are linked in tandem, the polypeptide having a 
stnicture of M2-M43-M94-M22-M1 1-M59-M33-M2. 

8. A hybrid polypeptide according to claim 7 wherein the polypeptide 
comprises the amino acid sequence of SEQ ID N0:4. 

9. A hybrid polypeptide, comprising at least seven different linked 
immunogenic peptides, wherein each peptide comprises an amino-terminal portion of a 
streptococcal M protein of at least 40 amino acids, and wherein the polypeptide has an amino- 
terrainal peptide that is reiterated as a carboxy-terminal peptide and the polypeptide is capable of 
eliciting an immune response against more than one antigen of group A streptococci comprising 
at least M75, M76, M77, M89, M92, MlOl, and Ml 14. 

10. The hybrid polypeptide according to claim 9 wherein the amino-terminal 
immunogenic peptide of the polypeptide is M89. 

11. The hybrid polypeptide according to claim 10 wherein the polypeptide is 
recombinant and the immunogenic peptides are linked in tandem, the polypeptide having a 
structure of M89-M101-M77-M1 14-M75-M76-M92.M89. 

12. A hybrid polypeptide according to claim 11 wherein die polypeptide 
comprises the amino acid sequence of SEQ ID N0:6. 

13. A hybrid polypeptide, comprising at least seven different immunogenic 
peptides linked in tandem, wherein each peptide comprises an amino>terminal portion of a 

amino-terminal peptide that is reiterated as a carboxy-terminal peptide and the polypeptide is 
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capable of eliciting an immune response against more than one antigen of group A streptococci 
comprising at least Spa, Ml.O, Ml .2, M3, M12, M18, and M28. 

14. The hybrid polypeptide according to claim 13 wherein the amino-terminal 
immunogenic peptide of the polypeptide is Ml .0. 

15. The hybrid polypeptide according to claim 14 wherein the polypeptide is 
recombinant and the immunogenic peptides are linked in tandem, the polypeptide having a 
structure of Ml .0-M12-Spa-M28-M3-Ml ,2-Ml 8-Ml .0. 

16. A hybrid polypeptide according to claim 15 wherein the polypeptide 
comprises the amino acid sequence of SEQ ID N0:8. 

17. The hybrid polypeptide according to any one of cl^ms 3, 7, 11, and 15 
wherein the immimogenic peptides are linked by at least two amino acids encoded by a nucleic 
acid sequence that is a restriction enzyme recognition site. 

18. The hybrid polypeptide according to claim 17 wherein the nucleic acid 
sequence is a restriction enzyme recognition site; the recognition site being at least one of 
BamHl Clal EcoRl, //mdUI, Kpnl, Ncol, Nhel, Pml\, Pstl Sail, and ATxoI. 

19. The hybrid polypeptide according to any one of claims 1-16 wherein the 
polypeptide is capable of eliciting at least one opsonic antibody that is not a tissue cross-reactive 
antibody in a subject. 

20. The hybrid polypeptide according to claim 19 wherein the subject is a 
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21. The hybrid polypeptide according to any one of claims 1-16 further 
comprising a carboxy-terminal tag. 

22. The hybrid polypeptide according to claim 21 wherein the carboxy- 
terminal tag is selected from the group consisting of alkaline phosphatase, p-galactosidase, 
hexahistidine, epitope tag DYKDDDDK (SEQlDNO:18), epitope tag DLYDDDDK 
(SEQ ID NO: 19), and GST. 

23. The hybrid polypeptide according to claim 22 wherein the carboxy- 
terminal tag is hexahistidine. 

24. The hybrid polypeptide according to any one of claims 1-16 farther 
comprising at least one additional carboxy-terminal amino acid. 

25. The hybrid polypeptide according to claim 24 wherein the additional 
carboxy-terminal amino acid is a D-amino acid. 

26. The hybrid polypeptide according to claim 24 wherein the additional 
carboxy-terminal amino acid is cysteine. 

27. The hybrid polypeptide according to claim 23 further comprising at least 
one additional carboxy-terminal amino acid. 

28. The hybrid polypeptide according to claim 27 wherein the additional 
carboxy-terminal amino acid is cysteine. 

NOS:2,4,6,or8. 
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30. A nucleic acid molecule comprising a sequence encoding a hybrid 
polypeptide of SEQ ID N0S:2, 4, 6, or 8. 

31. A nucleic acid expression construct comprising an expression control 
sequence operably linked to a polynucleotide encoding a hybrid polypeptide of SEQ ID N0S:2, 
4, 6, or 8. 

32. The construct according to claim 31 wherein the expression control 
sequence is an inducible promoter. 

33. The construct according to claim 32 wherein the inducible promoter is 
selected from the group consisting of a lac, tac, trc, ara, trp, X phage, T7 phage, and T5 phage 
promoter. 

34. The construct according to claim 32 wherein the inducible promoter is a 
TS phage promoter. 

35. The construct according to claim 31 wherein the construct comprises a 
nucleic acid expression vector selected from the group comprising a plasmid, phagemid, shuttle 
vector, cosmid, and virus. 

36. The construct according to claim 31 wherein the construct comprises a 
nucleic acid expression vector, wherein the vector is a plasmid. 

37. The construct according to claim 36 wherein the plasmid is pT5 (SEQ ID 

N0:17). 

38. A host cell containing a construct according to any one of claims 3 1 -37. 
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39. The host cell of claim 38 wherein the host cell is selected ftom the group 
consisting of a bacterium, a yeast cell» a nematode cell> an insect cell, and a mammalian cell. 

40. The host cell of claim 39 wherein the host cell is a bacterium, wherein the 
bacterium is Escherichia coii. 

41. A method of producing a hybrid polypeptide, comprising culturing a host 
cell containing a nucleic acid expression vector comprising at least one expression control 
sequence operably linked to a nucleic acid molecule encoding a hybrid polypeptide of SEQ ID 
NOS:2, 4, 6, or 8, under conditions and for a time sufficient for expression of the polypeptide. 

42. The method of claim 41 wherein the expression control sequence is an 
inducible promoter. 

43. The method according to claim 42 wherein the inducible promoter is 
selected from the group consisting of a lac, iac, trc, ara, trp^ A. phage, T7 phage, T5 phage 
promoter. 

44. The method according to claim 42 wherein the inducible promoter is a T5 
phage promoter. 

45. The method according to claim 42 wherein the nucleic acid expression 
vector is pT5 (SEQ ID NO: 17). 

46. A hybrid polypeptide produced according to the method of claim 45. 
hybrid polypeptide according to any one of claims U16, 29, and 46. 
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48. A composition, comprising a phannaceutically acceptable carrier and a 
mixture of at least two of the hybrid polypeptides according to claims 1, 5, 9, and 13. 

49. The composition according to claim 48 wherein the polypeptides comprise 
immunogenic peptides linked in tandem, wherein said at least two polypeptides comprise a 
structure of M24-M5-M6^M19-M29-M14-M24, M2-M43-M94-M22-MI 1-M59-M33-M2, M89- 
MIO 1 -M77-M1 14-M75-M76-M92-M89, or Ml .0-M 1 2-Spa-M28.M3-M 1 .2^M 1 8-M 1 .0. 

50. A composition, comprising a pharmaceutically acceptable carrier and a 
mixture of the hybrid polypeptide according to claim 13 with at least one of the hybrid 
polypeptides according to any one of claims 1, or 9. 

5 1 . The composition according to claim SO wherein the polypeptides comprise 
immunogenic peptides linked in tandem, wherein the polypeptide according to claim 13 
comprises a structure of M1.0-M12'Spa-M28-M3-M1.2-M18-M1.0 and said at least one other 
polypeptide comprise a structure of M24-M5.M6-M19-M29-M14-M24, M2-M43-M94-M22- 
Ml 1-M59-NG3-M2, or M89-M101-M77-M114-M75-M76-M92-M89. 

52. A composition, comprising a pharmaceutically acceptable carrier and a 
mixture of at least three of the hybrid polypeptides according to claims 1, 5, 9, and 13. 

53. The composition according to claim 52 wherein the polypeptides comprise 
immunogenic peptides linked in tandem, wherein said at least three polypeptides comprise a 
structure of M24-M5-M6-M19-M29-M14.M24, M2-M43-M94-M22.MI1-M59.M33-M2, M89- 
M101-M77-M1 14-M75-M76-M92.M89, or M1.0-M12-Spa-M28-M3-M1.2-M18-M1.0. 

J-t. C(jiiipO:)lUOli, Culliplloiiig li pJiatl..v.\«Cutiv;Uti) UCLXj^^vilrlc Cull id' L^IaU li 

mixture of hybrid polypeptides according to claims 4, 8, 12, and 16. 
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55. The composition according to claim 47 wherein the polypeptides are in 
equimolar amounts. 

56. The composition according to claim 47 wherein at least one of the hybrid 
polypeptides includes a Spa immunogenic peptide. 

57. The composition according to claim 47 further comprising an adjuvant. 

58. The composition according to claim 57 wherein the adjuvant is alum or 

Freund's. 

59. The composition according to claim 54 further comprising an adjuvant, 
wherein said adjuvant is alum. 

60. A method for preventing a microbial infection, comprising administering 
to a subject a composition according to clmm 47 at a dose sufficient to elicit antibodies specific 
for one or more hybrid polypeptide, wherein the antibodies are opsonic and are not tissue cross- 
reactive. 

61 . The method for preventing an infection according to claim 60 wherein the 
microbial infection is a streptococcal infection. 

62. The method for preventing an infection according to claim 60 wherein tiie 
streptococcal infection is a group A streptococcal infection. 

63. The method for preventing an infection according to claim 60 wherein the 
hybiid polypeptide \s aJmini^ioicu by a ruuic ^ciccicd IjojiI llic ^:ouyi consii>Ui:g of Lutcial, 
parenteral, transdermal, transmucosal, and inhalation. 
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64. The method for preventing an infection according to claim 60 wherein the 
composition ftirther comprises an adjuvant. 

65. The method for preventing an infection according to claim 64 wherein the 
adjuvant is alum or Freimd's. 

66. The method for preventing an infection according to claim 60 wherein the 
subject is human or an animal. 

67. A plurality of antibodies produced by the method according to claim 60. 

68. The plurality of antibodies according to claim 67 wherein the antibodies 
further comprise at least one antibody specific for a M protein antigen not represented in a hybrid 
polypeptide. 

69. The plurality of antibodies according to claim 68 wherein the M protein 
antigen not represented in a hybrid polypeptide is M4. 

70. A method for treating or preventing a microbial infection, comprising 
administering to a subject a composition comprising a phamiaceutically acceptable carrier and a 
plurality of antibodies according to claims 67. 
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Hexavalent A. 3 
M24— > 

9 18 27 36 45 54 

5' ATG 6TC GCG ACT CGC TCT CA6 ACA GAT ACT CTG 6AA AAA GTA CAA GAA CGT GCT 



Met Val Ala Thr Arg Ser Gin Thr Asp Thr Leu 61 u Lys Val Gin Glu Arg Ala 

63 72 81 90 99 108 

GAC AAG TH GAG ATA GAA AAC AAT ACG TTA AAA CTT AA6 AAT AST GAC TTA A6T 



Asp Lys Phe Glu He Glu Asn Asn Thr Leu Lys Leu Lys Asn Ser Asp Leu Ser 

117 126 135 144 153 162 

TH AAT AAT AAA GCG TTA AAA GAT CAT AAT GAT GAG TTA ACT GAA GAG HG AGT 



Phe Asn Asn Lys Ala Leu Lys Asp His Asn Asp Glu Leu Thr Glu Glu Leu Ser 

Pmll M5---> 

171 180 189 198 207 216 

AAT GCT AAA GAG AAA CTA CGT CAC GTG GCC GTG ACT CGC GGT ACA ATA AAT GAC 



Asn Ala l^s Glu Lys Leu Arg His Val Ala Val Thr Arg Gly Thr He Asn Asp 

225 234 243 252 261 270 

CC6 CAA AGA 6CA AAA GAA GCT CTT GAC AAG TAT GAG CTA GAA AAC CAT GAC TTA 



Pro Gin Arg Ala Lys Glu Ala Leu Asp Lys Tyr Glu Leu Glu Asn His Asp Leu 

BamHI M6— > 

279 288 297 306 315 324 

AAA ACT AAG QGA TCC CGJ GTG TTT Ca CGC GGG ACG GTA GAA AAC CCG GAC AAA 



Lys Thr Lys Gly Ser Arg Val Phe Pro Arg Gly Thr Val Glu Asn Pro Asp Lys 

333 342 351 360 369 378 

GCA C6A GAA CTT CH AAC AAG TAT GAC GTA GAG AAC TCT ATG TTA CAA GCT AAT 



Ala Arg Glu Leu Leu Asn Lys Tyr Asp Val Glu Asn Ser Met Leu Gin Ala Asn 



FIG. 2A 
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Ncol M19— > 

387 396 405 414 423 432 

AAT GAC AAG TTA CCA TGG CGT GTG CGT TAT ACT CGC CAT ACQ CCA GAA GAT AAG 



Asn Asp Lys Leu Pro Trp Arg Val Arg Tyr Thr Arg His Thr Pro Glu Asp Lys 

441 450 459 468 477 486 

CTA AAA AAA AH ATT GAC GAT CTT GAC GCA AAA GAA CAT GAA TTA CAA CAA CAG 



Leu Lys Lys He He Asp Asp Leu Asp Ala Lys G1u His Glu Leu Gin Gin Gin 
PstI M29— > 

495 504 513 522 531 540 

AAT GAG AAG TTA TOT CTG GAG AAA GTG TAT ATT ACT CGT G6T ATG ACA AAA GAG 



Asn Glu Lys Leu Ser Leu Gin Lys Val Tyr He Thr Arg Gly Met Thr Lys Glu 

549 558 567 576 585 594 

GAC GTA GAA AAA ATT Ga AAC AAC CTT GAC ATA GAA AAC CAT GG6 TTA AAA CAA 



Asp Val Glu Lys He Ala Asn Asn Leu Asp He Glu Asn His Gly Leu Lys Gin 

Kpnl 

603 612 621 630 639 648 

CAG AAT GAA CAG TTA TCT ACT GAT AAA CAA GGT CTJ GAA GAA CAG AAT GGT ACC 



Gin Asn Glu Gin Leu Ser Thr Asp Lys Gin Gly Leu Glu Glu Gin Asn Gly Thr 
M14— > 

657 666 675 684 693 702 

GAT CGC GTT AGT CGT TCT ATG TCA OSC GAT GAT CTA TTA AAC AG6 6CT CAG GAT 



Asp Arg Val Ser Arg Ser Met Ser Arg Asp Asp Leu Leu Asn Arg Ala Gin Asp 

711 720 729 738 747 756 

Cn GAA GCA AAA AAC CAC QQG TTA GAA CAC CAG AAT ACT AAG TTA TCT ACT GAA 



Leu Glu Ala Lys Asn His Gly Leu Glu His Gin Asn Thr Lys Leu Ser Thr Glu 



FIG. 2B 



wo 03/(k;5973 



PCT/US02/34769 



4/22 

Clal M24— > 

765 774 783 792 801 810 

MT AAA ACG CTT CAA GAA CAA GCA GAA 6CA CGC CAG AAA GAA ATC GAT GTC GCG 



Asn Lys Thr Leu Gin Glu Gin Ala Glu Ala Arg Gin l^s Glu He Asp Val Ala 

819 828 837 846 855 864 

ACT CGC TCt CAG ACA GAT ACT CTG GAA AAA GTA CAA GAA CGT 6CT GAC AAG TTT 



Thr Arg Sen Gin Thr Asp Thr Leu Glu Lys Val Gin Glu Arg Ala Asp Lys Phe 

873 882 891 900 909 918 

GAG ATA GAA AAC AAT ACG TTA AAA CTT AAG AAT AGT GAC TTA AGT TTT AAT AAT 



Glu He Glu Asn Asn Ttir Leu Lys Leu Lys Asn Ser Asp Leu Ser Phe Asn Asn 

927 936 945 954 963 972 

AAA GCG TTA AAA GAT CAT AAT GAT GAG TTA AQ GAA GAG TTG AGT AAT GCT AAA 



Lys Ala Leu Lys Asp His Asn Asp Glu Leu Thr Glu Glu Leu Ser Asn Ala Lys 

981 990 999 

GAG AAA CTA CGT CAC CAC CAC CAC CAC CAC TGA 3' 



Glu Lys Leu Arg His His His His His His stop 



FIG. 2C 
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Septavalent B.3a 
Ml.O— > 

9 18 27 36 45 54 

5' ATG AAC GGT GAT GGT AAT CCT AG6 GAA 6TT ATA GAA GAT CTT 6CA 6CA AAC AAT 



Het Asn Gly Asp Gly Asn Pro Arg Glu Val He Glu Asp Leu Ala Ala Asn Asn 

63 72 81 90 99 108 

CCC GCA ATA CAA AAT ATA CGT TTA C6T CAC GAA AAC AAG GAC TTA AAA QCG AGA 



Pro Ala He Gin Asn He Arg Leu Arg His Glu Asn Lys Asp Leu Lys Ala Arg 

(XhoI-SalI)M12— > 
117 126 135 144 153 162 

TTA GAG AAT GCA ATG GAA GTT GCA G6A AGA GAT TTT AAG AGA GCT CTC GAC GAT 



Leu Glu Asn Ala Met Glu Val Ala Gly Arg Asp Phe Lys Arg Ala Leu Asp Asp 

171 180 189 198 207 216 

CAT ACT GAT TTA GTC GCA GAA AAA CAA CGT HA GAA GAT TTA G6A CAA AAA TTT 



His Ser Asp Leu Val Ala Glu Lys Gin Arg Leu Glu Asp Leu Gly Gin Lys Phe 

225 234 243 252 261 270 

6ftA AGA CTG AAA CAG CGT TCA GAA CTC TAC CTT CAG CAA TAC TAT GAT AAT AAA 



Glu Arg Leu Lys Gin Arg Ser Glu Leu Tyr Leu Gin Gin Tyr Tyr Asp Asn l^s 

BaniHI Spa— > 

279 288 297 306 315 324 

TCA AAT GGA TAT AAA GGT GAC TGG TAT GTA CAA CAG TTA GGA TCC GAT TCA GTA 



Ser Asn Gly Tyr Lys Gly Asp Trp Tyr Val Gin Gin Leu Gly Ser Asp Ser Val 

333 342 351 360 369 378 

AGT GGA HA GAG GTG GCA GAC CCC TCT GAT AGT AAG AAA CH ATT GAA TTA GGT 



Ser Gly Leu Glu Val Ala Asp Pro Ser Asp Ser l^s Lys Leu He Glu Leu Gly 



FIG. 3A 
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387 396 405 414 423 432 

TTG Ga AAA TAC CH AAT GAT AAA TTA CCC TTT AAA ACT AAA GAA GAT TCA GAG 



Leu Ala Lys Tyr Leu Asn Asp Lys Leu Pro Phe Lys Thr Lys 61 u Asp Ser Glu 

PstI M28— > 

441 450 459 468 477 486 

ATT TTA TCA GAG TTA CGT GAT GTA TTA AAA AAT CTG CAG GAG TCT CCA AAA AGT 



He Leu Ser Glu Leu Arg Asp Val Leu Lys Asn Leu Gin Glu Ser Pro Lys Ser 

495 504 513 522 531 540 

ACT GAG ACT TCT GCT AAT GGA GCT GAT AAA HA GCT GAT GCA TAC AAC ACA TTG 



Thr Glu Thr Ser Ala Asn Gly Ala Asp Lys Leu Ala Asp Ala Tyr Asn Thr Leu 

549 558 567 576 585 594 

CTT ACT GAA CAT GAG AAA aC AGA GAT GAG TAT TAT ACA TTA ATT GAT OCT AAA 



Leu Thr Glu His Glu Lys Leu Arg Asp Glu Tyr Tyr Thr Leu He Asp Ala Lys 

Kpnl M3— > 

603 612 621 630 639 648 

GAA GAA GAA CCT CGC TAT AAA GCA TTG GGT ACC TTG TTA GAT CAG GH ACA CAA 



Glu Glu Glu Pro Arg Tyr Lys Ala Leu Gly Thr Leu Leu Asp Gin Val Thr Gin 

657 666 675 684 693 702 

TTA TAT ACT AAA CAT AAT AGT AAT TAC CAA CAA TAT AAT GCA CAA GCT GGC AGA 



Leu Tyr Thr Lys His Asn Ser Asn Tyr Gin Gin Tyr Asn Ala Gin Ala Gly Arg 

711 720 729 738 747 756 

CTT GAC CTG AGA CAA AAG GCT GAA TAT CTA AAA GGC CTT AAT GAT TGG GCT GAG 



Leu Asp Leu Arg Gin Lys Ala Glu Tyr Leu Lys Gly Leu Asn Asp Trp Ala Glu 

Kpnl M1.2— > 
765 774 783 792 801 810 

CGC CTG TTA CAA GAG HA AAT GGT ACC AAC AAT GAT GGT OGT TCT CGT GAC GTT 



Arg Leu Leu Gin Glu Leu Asn Gly Thr Asn Asn Asp Gly Arg Ser Arg Asp Val 



FIG. 3B 
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819 828 837 846 855 864 

ACG GAA GAG ATT GCA GCA AAC AAT ACC ACA GTA CAA AAT ATA CGT TTA CGT AAC 



Thr Glu Glu He Ala Ala Asn Asn Thr Thr Val Gin Asn He Arg Leu Arg Asn 

873 882 891 900 909 918 

GAA AAC AAG AAC TTA AAA GCG AAA AAC GAG 6AC HA GAA 6CG AGA TTA GAG AAT 



Glu Asn ly$ Asn Leu Lys Ala Lys Asn Glu Asp Leu Glu Ala Arg Leu Glu Asn 

EcoRI M18— > 

927 936 945 954 963 972 

GCA ATG AAT GH GCA G6A ^ GAT TTT AAG CGT GCT GAA TJC GCA CCT CTT ACT 



Ala Met Asn Val Ala Gly Arg Asp Phe lys Arg Ala Glu Phe Ala Pro Leu Thr 

981 990 999 1008 1017 1026 

CGT 6CT ACA GCA GAC AAT AAA 6AC GAA TTA ATA AAA A6A 6CT AAC QGT TAT GAG 



Arg Ala TTir Ala Asp Asn Lys Asp Glu Leu He lys Arg Ala Asn Gly Tyr Glu 

1035 1044 1053 1062 1071 1080 

ATA CAG AAC CAT CAG TTA ACA GTT GAG AAT AAA AAA TTA AAA Aa GAT AAG GAA 



He Gin Asn His Gin Leu Thr Val Glu Asn Lys Lys Leu Lys Thr Asp [ys Glu 

Pmll Ml.O— > 
1089 1098 1107 1116 1125 1134 

CAG TTA ACA AAA GAG AAT GAT GAT TTA AAA CAG 6TG AAC GGT GAT GGT AAT CCT 



Gin Leu Thr Lys Glu Asn Asp Asp Leu Lys His Val Asn Gly Asp Gly Asn Pro 

1143 1152 1161 1170 1179 1188 

CGT GAA GIT ATA GAA GAT CTT GCA GCA AAC AAT CCC GCA ATA CAA AAT ATA CGT 



Arg Glu Val He Glu Asp Leu Ala Ala Asn Asn Pro Ala He Gin Asn He Arg 

1197 1206 1215 1224 1233 1242 

TTA CGT CAC GAA AAC AAG GAC TTA AAA GCG AGA HA GAG AAT GCA ATG GAA GU 



Leu Arg His Glu Asn Lys Asp Leu Lys Ala Arg Leu Glu Asn Ala Met Glu Val 



FIG. 3C 
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1251 1260 1269 1278 1287 

GCA QGA CGI GAT TTT AAG CGT GCT CAC CAC CAC CAC CAC CAC TAA 3' 



Ala Gly Arg Asp Phe Lys Arg Ala His His His His His His stop 



FIG. 3D 
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Septa va lent C.2 
M2— > 

9 18 27 36 45 54 

5' AT6 AGT AAG AAC CCT GTC CCT CTC AAA AAA 6AA GCA AAA TTA AGT GAA GCA GAA 



Met Ser Lys Asn Pro Val Pro Val Lys Lys Glu Ala Lys Leu Ser Glu A1a Glu 

63 72 81 90 99 108 

TTA CAT 6AC AAA ATT AAA AAC CTT GAA GAG GAA AAA OCA GAA HA TTC GAG AAA 



Leu His Asp Lys He Lys Asn Leu Glu Glu Glu Lys Ala Glu Leu Phe Glu Lys 

(XhoI-SalI)H43— > 

117 126 135 144 153 162 

CTC GAC GAA GAA CAC CCT GAC GTT GTC 6CT GCT AGA 6AA AGC GTA CTA AAT AAT 



Leu Asp Glu Glu His Pro Asp Val Val Ala Ala Arg Glu Ser Val Leu Asn Asn 

171 180 189 198 207 216 

GTC C6T GTA CCG GGT ACA CTT TGG CTA C6T CAA AAA GAA GAA AAT GAC AAA CTT 



Val Arg Val Pro Gly Thr Leu Trp Leu Arg Gin Lys Glu Glu Asn Asp Lys Leu 

Nhel 

225 234 243 252 261 270 

AAATTGGAAAAGAAAGGGCTTGAGAaGAGnACAGGAAAAGGAACAAGCTAGC 



Lys Leu Glu Lys Lys Gly Leu Glu Thr Glu Leu Gin Glu Lys Glu Gin Ala Ser 
M94— > 

279 288 297 306 315 324 

GAA GAA 6CA TCA AAT AAT GGG CAA CTC ACA TTA CAG CAT AAA AAT AAT GCA TTG 



Glu Glu Ala Ser Asn Asn Gly Gin Leu Tlir Leu Gin His Lys Asn Asn Ala Leu 

333 342 351 360 369 378 

ACT AGT GAG AAT GAG TCT CH OGT CGT GAA AAA GAT OGT TAT TTG TAT GAA AAA 



Til r Ser Glu Asn Glu Ser Leu Arg Arg Glu Lys Asp Arg Tyr Leu Tyr Glu Lys 



FIG. 4A 
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Ban«I m—> 

387 396 405 414 423 432 

GAA 6AA TTA GAA GGA TCC GAG TCA TCA AAT AAT GCG GAG TCA TCA AAC AF TCT 



Glu Glu Leu G1u Gly Sen Glu Ser Ser Asn Asn Ala Glu Ser Ser Asn He Ser 

441 450 459 468 477 486 

CAA GAA AGC AAA CTA ATA AAT ACA HG ACT GAT GAA AAT GAG AAA CTC AGA GAA 



Gin Glu Ser Lys Leu He Asn Ttir Leu Thr Asp Glu Asn Glu Lys Leu Arg Glu 

495 504 513 522 531 540 

GAG CTC CAA CA6 TAT TAT GCA HA AGT GAT GCT AAA GAA GAA GAA CCT CCT TAT 



Glu Leu Gin Gin Tyr Tyr Ala Leu Ser Asp Ala Lys Glu Glu Glu Pro Arg Tyr 
PstI Mil— > 

549 558 567 576 585 594 

AAA GCA CTG CAG ACT GAA GTT AAG GCT GCG GGG CAA AGC GCT CCT AAA GGT ACA 



Lys Ala Leu Gin Thr Glu Val Lys Ala Ala Gly Gin Ser Ala Pro Lys Gly Thr 

603 612 621 630 639 . 648 

AAC GTG AGC GCA GAC CTA TAT AAT TCG CTA TGG GAT GAA AAT AAA ACT CH AGA 



Asn Val Ser Ala Asp Leu Tyr Asn Ser Leu Trp Asp Glu Asn Lys Thr Leu Arg 

657 666 675 684 693 702 

GAA AAA CAA GAA GAG TAT ATA ACA AAA ATT CAA AAT GAA GAG ACA AAA AAT AAA 



Glu Lys Gin Glu Glu Tyr He Thr Lys He Gin Asn Glu Glu Thr Lys Asn Lys 
Kpnl M59— > 

711 720 729 738 747 756 

GGT ACC GAA CAA GCA AAA AAT AAT AAT GGG GAA CTC ACA HA CAG CAA AAA TAC 



Gly Thr Glu Gin Ala Lys Asn Asn Asn Gly Glu Leu Tlir Leu Gin Gin Lys Tyr 

765 774 783 792 801 810 

GAT GCA TTG ACT AAT GAG AAT AAG TCT CTT CGT CGT GAG CGJ GAT AAC TAT HA 



Asp Ala Leu Til r Asn Glu Asn Lys Ser Leu Arg Arg Glu Arg Asp Asn Tyr Leu 

FIG. 4B 
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Ncol M33— > 

819 828 837 846 855 864 

AAT TAT TTA TAT GAA AAA CCA TGG GAA GAG CAT GAA AAA GTA ACA CAA GCC AGA 



Asn Tyr Leu Tyr Glu Lys Pro Trp Glu Glu His Glu Lys Val Thr Gin Ala Arg 

873 882 891 900 909 918 

GAA QCG GH ATC AGA GAG ATG CAA CA6 AGG GGG ACA AAT TTT QGA OCT CTG HA 



Glu Ala Val He Arg Glu Met Gin Gin Arg Gly Thr Asn Phe Gly Pro Leu Leu 

Pnill 

927 936 945 954 963 972 

GCA AGT ACA ATG C6A GAT AAT CAC AAT HA AAA GAA ACG CH GAC AAA ACT CAC 



Ala Ser Thr Met Arg Asp Asn His Asn Leu Lys Glu Thr Leu Asp Lys Thr His 
M2"-> 

981 990 999 1008 1017 1026 

GTG AGT AAG AAC CCT GTC CQ GTC AAA AAA GAA GCA AAA TTA AGT GAA GCA GAA 



Val Ser Lys Asn Pro Val Pro Val Lys Lys Glu Ala Lys Leu Ser Glu Ala Glu 

1035 1044 1053 1062 1071 1080 

TTA CAT GAC AAA ATT AAA AAC CTT GAA GAG GAA AAA GCA GAA TO TTC GAG AAA 



Leu His Asp Lys He Lys Asn Leu Glu Glu Glu Lys Ala Glu Leu Phe Glu Lys 

1089 1098 1107 

CTC GAG CAC CAC CAC CAC CAC CAC TGA 3' 



Leu Glu His His His His His His stop 



FIG. 4C 
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Septavalent D.3 
M89— > 

9 18 27 36 45 54 

5' AT6 AGT GAC MT ATT AAT C6T TCT GTC TCT 6TC AAA GAT AAT 6AA AAA 6AA TTA 



Met Ser Asp Asn He Asn Arg Ser Val Ser Val l^s Asp Asn Glu Lys Glu Leu 

63 72 81 90 99 108 

CAT AAC AAA AH GCA GAC Cn GAA GAG GAA AGG GGT GAA CAT CIA GAC AAA ATA 



His Asn Lys lie Ala Asp Leu Glu Glu Glu Arg Gly Glu His Leu Asp Lys He 

BaniHI MlOl— > 

117 126 135 144 153 162 

GAT GAA CTA AAA GAA 6AA CTA AAA GCA AAG GAA AAA AGT TCA GGA TCC 6CT GAT 



Asp Glu Leu Lys Glu Glu Leu Lys Ala Lys Glu Lys Ser Ser Gly Ser Ala Asp 

171 180 189 198 207 216 

CAC CCT AGC TAT ACC GCT GCT AAA GAT GAA GTA CTA AGT AAG TTC TCT GTA CCG 



His Pro Ser Tyr Thr Ala Ala Lys Asp Glu Val Leu Ser Lys Phe Ser Val Pro 

225 234 243 252 261 270 

GGT CAT GTT TG6 OCA CAT GAA AGA GAA AAA AAT GAC AAA CTT AGC TCG GAA AAT 



Gly His Val Trp Ala His Glu Arg Glu Lys Asn Asp Lys Leu Ser Ser Glu Asn 

Mhel M77— > 

279 288 297 306 315 324 

GAA GOG CTT AAG GCT GGT TTA CAG GAA AAG GAA CAA GCT AGC GAA GGG GTT TCT 



Glu Gly Leu Lys Ala Gly Leu Gin Glu Lys Glu Gin Ala Ser Glu Gly Val Ser 

333 342 351 360 369 378 

GTA GGT TCA GAT GCA TCA CTA CAT AAC CGC ATT ACA GAC CTT GAA GAG GAA AGA 



Val Gly Ser Asp Ala Ser Leu His Asn Arg He Thr Asp Leu Glu Glu Glu Arg 



FIG. 5A 
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387 396 405 414 423 432 

GAA AAA HA HA AAT AAA FA GAT AAA 6n GAA GAA GAG CAT AAA AAA GAT CAT 



61u Lys Leu Lai Asn Lys Leu Asp Lys Val Glu Glu Glu His Lys Lys Asp His 
Kpnl M114— > 

441 450 459 468 477 486 

GAA CAA G6T ACC AAC AGT AAG AAC OCT GCC COT GCC CCT 6CC TCT GCT 6TC CQ 



Glu Gin Gly Thr Asn Ser Lys Asn Pro Ala Pro Ala Pro Ala Ser Ala Val Pro 

495 504 513 522 531 540 

GTC AAA AAA GAA GCA ACA AAA HA AGT GAA 6CA GAA TTA TAT AAC AAA ATT CAA 



Val Lys Lys Glu Ala Thr Lys Leu Ser Glu Ala Glu Leu Tyr Asn Lys He Gin 

BaniHI M75— > 

549 558 567 576 585 594 

GAA Cn GAA GAG GGA AAA GCA GAA TTA TTC GGA TCC GAA GAA GAA C6T ACT TTT 



Glu Leu Glu Glu Gly Lys Ala Glu Leu Phe Gly Ser Glu Glu Glu Arg Thr Phe 

603 612 621 630 639 648 

ACT GAG TTA CCA TAT GAA GCA C6A TAC AAA GCA TGG AAA AGT GAA AAT GAT GAG 



Thr Glu Leu Pro Tyr Glu Ala Arg Tyr Lys Ala Trp Lys Ser Glu Asn Asp Glu 

657 666 675 684 693 702 

CTT CGG GAA AAT TAT C6T C6T ACC HA GAT AAG ITT AAT ACT GAG CAA GGT AAG 



Leu Arg Glu Asn Tyr Arg Arg Thr Leu Asp Lys Phe Asn Ttir Glu Gin Gly Lys 

Hindlll M76— > 

711 720 729 738 747 756 

ACT ACG CGC TTA GAA GAA CAA AAT AAG CTT GCG GAC GCG AAC TCG AAA A6C GTT 



Thr Thr Arg Leu Glu Glu Gin Asn Lys Leu Ala Asp Ala Asn Ser Lys Ser Val 

765 774 783 792 801 810 

TCT AAT AGT AAC GTG AGC ATA AAT CTA TAT AAT GAG CTA CAG GCT GAA CAT GAT 



Ser Asn Ser Asn Val Ser He Asn Leu Tyr Asn Glu Leu Gin Ala Glu His Asp 

FIG. 5B 
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819 828 837 846 855 864 

AAG aA CA6 ACT AAA CAT GAG GAG CTA TTG Ga GAA CAT GAT GCT CH AAA GAA 



Lys Leu Gin Thr Lys His Glu Glu Leu Leu Ala Glu His Asp Ala Leu Lys Glu 

EcoRI M92— > 

873 882 891 900 909 918 

AAA CAA GAT AAA AAT CAA GAA nC GAT 6AC CG6 AGC GTT TCT ACT AAT AGT GGT 



Lys Gin Asp Lys Asn Gin Glu Phe Asp Asp Arg Ser Val Ser Thr Asn Sen Gly 

927 936 945 954 963 972 

AGC 6TG AGC ACA CCA TAT AAT AAC CTA TTG AAT GAA TAT GAT GAC CTA TTG GCT 



Ser Val Ser Thr Pro Tyr Asn Asn Leu Leu Asn Glu Tyr Asp Asp Leu Leu Ala 

981 990 999 1008 1017 1026 

AAA CAT GGT GAG aA TTG AGT GAA TAT GAT Ga CTT AAA GAA AAA CAA GAT AAA 



Lys His Gly Glu Leu Leu Ser Glu Tyr Asp Ala Leu Lys Glu Lys Gin Asp Lys 
Nhel M89— > 

1035 1044 1053 1062 1071 1080 

AAT CAA GAA GCT AGC AGT GAC AAT AH AAT CGT TCT GTC TCT GTC AAA GAT AAT 



Asn Gin Glu Ala Ser Ser Asp Asn He Asn Arg Ser Val Ser Val Lys Asp Asn 

1089 1098 1107 1116 1125 1134 

GAA AAA GAA HA CAT AAC AAA AH GCA GAC CTT GAA GAG GAA AGG GGT GAA CAT 



Glu Lys Glu Leu His Asn Lys He Ala Asp Leu Glu Glu Glu Arg Gly Glu His 

1143 1152 1161 1170 1179 1188 

aA GAC AAA ATA GAT GAA aA AAA GAA GAA aA AAA 6CA AAG GAA AAA AGT TCA 



Leu Asp Lys He Asp Glu Leu Lys Glu Glu Leu Lys Ala Lys Glu Lys Ser Ser 

1197 1206 
CAC CAC CAC CAC CAC CAC TGA 3" 



His His His His His His stop 



HG. 5C 
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SEQUENCE LISTING 

<110> ID Biomedical Corporation of Washington 

University of Tennessee Research Corporation 
Reddish/ Mark A, 
Hu, Mary C, 
Walls, Michael A. 
Dale, James B. 



<120> MULTIVALENT STREPTOCOCCAL VACCINE COMPOSITIONS AND 
METHODS FOR USE 

<130> 481112. 413PC 

<140> PCT 

<141> 2002-10-28 

<160> 19 

<170> PatentIn Ver. 2.1 

<210> 1 
<211> 1005 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) (1002) 

<220> 

<223> Description of Artificial Sequence: Nucleic acid 
encoding Hybrid of Group A Streptococci M protein 
and M-like protein 

<400> 1 

atg gtc gcg act cgc tct cag 
Met Val Ala Thr Arg Ser Gin 
1 5 

cgt get gac aag ttt gag ata 
Arg Ala Asp Lys Phe Glu lie 
20 

agt gac tta agt ttt aat aat 
Ser Asp Leu Ser Phe Asn Asn 
35 

tta act gaa gag ttg agt aat 
Leu Thr Glu Glu Leu Ser Asn 
50 55 

gtg act cgc ggt aca ata aat 
Val Thr Arg Gly Thr He Asn 
65 70 



aca gat act ctg gaa aaa gta caa gaa 48 
Thr Asp Thr Leu Glu Lys Val Gin Glu 
10 15 

gaa aac aat acg tta aaa ctt aag aat 96 
Glu Asn Asn Thr Leu Lys Leu Lys Asn 
25 30 

aaa gcg tta aaa gat cat aat gat gag 144 
Lys Ala Leu Lys Asp His Asn Asp Glu 
40 45 

get aaa gag aaa eta cgt cac gtg gcc 192 
Ala Lys Glu Lys Leu Arg His Val Ala 
60 

gac ccg caa aga gca aaa gaa get ctt 240 
Asp Pro Gin Arg Ala Lys Glu Ala Leu 
75 80 



gac aag tat gag eta gaa aac cat gac tta aaa act aag gga tec cgt 



288 
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Asp Lys Tyr Glu Leu Glu Asn 
85 


His Asp Leu 
90 


Lys 


Thr 


Lys 


Gly 


Ser 
95 


Arg 




gtg ttt cct cgc ggg acg gta 
Val Phe Pro Arg Gly Thr Val 
100 


gaa 
Glu 


aac ccg 
Asn Pro 
105 


gac 
Asp 


aaa 
Lys 


gca 
Ala 


cga 
Arg 
110 


gaa 
Glu 


ctt 
Leu 


336 


ctt aac aag tat gac 
Leu Asn Lys Tyr Asp 
115 


gta 
Val 


gag 
Glu 


aac 
Asn 
120 


tct atg 
Ser Met 


tta 
Leu 


caa 
Gin 


get 
Ala 
125 


aat 
Asn 


aat 
Asn 


gac 
Asp 


384 


aag tta cca tgg cgt gtg 
Lys Leu Pro Trp Arg Val 

130 


cgt 
Arg 
135 


tat act cgc 
Tyr Thr Arg 


cat 
His 


acg 
Thr 

140 


cca 
Pro 


gaa 
Glu 


gat 
Asp 


aag 
Lys 


432 


eta aaa aaa att att 
Leu Lys Lys He lie 
145 


gac gat 
Asp Asp 
150 


ctt gac gca 
Leu Asp Ala 


aaa 
Lys 
155 


gaa 
Glu 


cat 
His 


gaa 
Glu 


tta 
Leu 


caa 
Gin 
160 


480 


caa cag aat gag aag 
Gin Gin Asn Glu Lys 
165 


tta 
Leu 


tct 
Ser 


ctg 
Leu 


cag aaa 
Gin Lys 
170 


gtg 
Val 


tat 
Tyr 


att 
He 


act 
Thr 


cgt 
Arg 
175 


ggt 
Gly 


528 


atg aca aaa gag gac gta 
Met Thr Lys Glu Asp Val 
180 


gaa 
Glu 


aaa 
Lys 


att get 
He Ala 
185 


aac 
Asn 


aac 
Asn 


ctt 
Leu 


gac 
Asp 
190 


ata 
He 


gaa 
Glu 


576 


aac cat ggg tta aaa 
Asn His Gly Leu Lys 
195 


caa 
Gin 


cag 
Gin 


aat 
Asn 
200 


gaa cag 
Glu Gin 


tta 
Leu 


tct 
Ser 


act 
Thr 
205 


gat 
Asp 


aaa 
Lys 


caa 
Gin 


62 4 


ggt ctt gaa gaa cag aat 
Gly Leu Glu Glu Gin Asn 
210 


ggt 
Gly 
215 


acc gat cgc 
Thr Asp Arg 


gtt 
Val 


agt 
Ser 
220 


cgt 
Arg 


tct 
Ser 


atg 
Met 


tea 
Ser 


672 


cgc gat gat eta tta 
Arg Asp Asp Leu Leu 
225 


aac 
Asn 
230 


agg 
Arg 


get cag gat 
Ala Gin Asp 


ctt 
Leu 
235 


gaa 
Glu 


gca 
Ala 


aaa 
Lys 


aac 
Asn 


cac 
His 
240 


720 


ggg tta gaa cac cag 
Gly Leu Glu His Gin 

245 


aat 
Asn 


act 
Thr 


aag 
Lys 


tta tct 
Leu Ser 
250 


act 
Thr 


gaa 
Glu 


aat 
Asn 


aaa 
Lys 


acg 
Thr 
255 


ctt 
Leu 


768 


caa gaa caa gca gaa 
Gin Glu Gin Ala Glu 
260 


gca 
Ala 


cgc 
Arg 


cag 
Gin 


aaa gaa 
Lys Glu 
265 


ate 
He 


gat 
Asp 


gtc 
Val 


gcg 
Ala 
270 


act 
Thr 


cgc 
Arg 


816 


tct cag aca gat act 
Ser Gin Thr Asp Thr 
275 


ctg 
Leu 


gaa 
Glu 


aaa 
Lys 
280 


gta caa 
Val Gin 


gaa 
Glu 


cgt 
Arg 


get 
Ala 
285 


gac 
Asp 


aag 
Lys 


ttt 
Phe 


864 


gag ata gaa aac aat 
Glu He Glu Asn Asn 
290 


acg 
Thr 


tta 
Leu 
295 


aaa 
Lys 


ctt aag 
Leu Lys 


aat 
Asn 


agt 
Ser 
300 


gac 
Asp 


tta 
Leu 


agt 
Ser 


ttt 
Phe 


912 


aat aat aaa gcg tta aaa gat 
Asn Asn Lys Ala Leu Lys Asp 
305 310 


cat aat gat 
His Asn Asp 


gag 
Glu 
315 


tta 
Leu 


act 
Thr 


gaa 
Glu 


gag 
Glu 


ttg 
Leu 
320 


960 
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agt aat get aaa gag aaa eta cgt eac cac cac cac cac 
Ser Asn Ala Lys Glu Lys Leu Arg His His His His His 
325 330 



cac tga 
His 



1005 



<210> 2 
<211> 334 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hybrid of Group A Streptococci M 
protein and M-like protein 

<400> 2 

Met Val Ala Thr Arg Ser Gin Thr Asp Thr Leu Glu Lys Val Gin Glu 
.1 5 10 15 

Arg Ala Asp Lys Phe Glu lie Glu Asn Asn Thr Leu Lys Leu Lys Asn 

20 25 30. 

Ser Asp Leu Ser Phe Asn Asn Lys Ala Leu Lys Asp His Asn Asp Glu 
35 40 ' 45 

Leu Thr Glu Glu Leu Ser Asn Ala Lys Glu Lys Leu Arg His Val Ala 
50 55 60 

Val Thr Arg Gly Thr lie Asn Asp Pro Gin Arg Ala Lys Glu Ala Leu 
65 70 75 80 

Asp Lys Tyr Glu Leu Glu Asn His Asp Leu Lys Thr Lys Gly Ser Arg 
85 90 95 

Val Phe Pro Arg Gly Thr Val Glu Asn Pro Asp Lys Ala Arg Glu Leu 
100 105 110 

Leu Asn Lys Tyr Asp Val Glu Asn Ser Met Leu Gin Ala Asn Asn Asp 
115 120 125 

Lys Leu Pro Trp Arg Val Arg Tyr Thr Arg His Thr Pro Glu Asp Lys 
130 135 140 

Leu Lys Lys lie lie Asp Asp Leu Asp Ala Lys Glu His Glu Leu Gin 
145 150 155 160 

Gin Gin Asn Glu Lys Leu Ser Leu Gin Lys Val Tyr lie Thr Arg Gly 
165 170 175 

Met Thr Lys Glu Asp Val Glu Lys lie Ala Asn Asn Leu Asp He Glu 
180 185 190 

Asn His Gly Leu Lys Gin Gin Asn Glu Gin Leu Ser Thr Asp Lys Gin 
195 200 205 

Gly Leu Glu Glu Gin Asn Gly Thr Asp Arg Val Ser Arg Ser Met Ser 
210 215 220 
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Arg Asp Asp Leu Leu Asn Arg Ala Gin Asp Leu Glu Ala Lys Asn His 
225 230 235 240 

Gly Leu Glu His Gin Asn Thr Lys Leu Ser Thr Glu Asn Lys Thr Leu 
245 250 255 

Gin Glu Gin Ala Glu Ala Arg Gin Lys Glu He Asp Val Ala Thr Arg 
260 265 270 

Ser Gin Thr Asp Thr Leu Glu Lys Val Gin Glu Arg Ala Asp Lys Phe 
275 280 285 

Glu He Glu Asn Asn Thr Leu Lys Leu Lys Asn Ser Asp Leu Ser Phe 
290 295 300 

Asn Asn Lys Ala Leu Lys Asp His Asn Asp Glu Leu Thr Glu Glu Leu 
305 310 315 320 

Ser Asn Ala Lys Glu Lys Leu Arg His His His His His His 
325 330 



<210> 3 
<211> 1107 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . {1104) 

<220> 

<223> Description of Artificial Sequence: Hybrid of 

Group A Streptococci M protein and M-like protein 

<400> 3 

atg agt aag aac cot gtc cot gtc aaa aaa gaa gca aaa tta agt gaa 48 

Met Ser Lys Asn Pro Val Pro Val Lys Lys Glu Ala Lys Leu Ser Glu 

1 5 10 - 15 

gca gaa tta cat gac aaa att aaa aac ctt gaa gag gaa aaa gca gaa 96 
Ala Glu Leu His Asp Lys He Lys Asn Leu Glu Glu Glu Lys Ala Glu 
20 25 30 

tta ttc gag aaa etc gac gaa gaa cac cct gac gtt gtc get get aga 144 
Leu Phe Glu Lys Leu Asp Glu Glu His Pro Asp Val Val Ala Ala Arg 
35 40 45 

gaa age gta eta aat aat gtc cgt gta ccg ggt aca ctt tgg eta cgt 192 
Glu Ser Val Leu Asn Asn Val Arg Val Pro Gly Thr Leu Trp Leu Arg 
50 55 60 

eaa aaa gaa gaa aat gac aaa ctt aaa ttg gaa aag aaa ggg ctt gag 240 
Gin Lys Glu Glu Asn Asp Lys Leu Lys Leu Glu Lys Lys Gly Leu Glu 
65 70 75 80 



act gag tta cag gaa aag gaa caa get age gaa gaa gea tea aat aat 288 
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Thr Glu Leu Gin Glu Lys Glu Gin Ala Ser Glu Glu Ala Ser Asn Asn 
85 90 95 

ggg caa etc aca tta cag cat aaa aat aat gca ttg act agt gag aat 336 
Gly Gin Leu Thr Leu Gin His Lys Asn Asn Ala Leu Thr Ser Glu Asn 
100 105 110 

gag tot ctt cgt cgt gaa aaa gat cgt tat ttg tat gaa aaa gaa gaa 384 
Glu Ser Leu Arg Arg Glu Lys Asp Arg Tyr Leu Tyr Glu Lys Glu Glu 
115 120 125 

tta gaa gga tec gag tea tea aat aat geg gag tea tea aac att tct 432 
Leu Glu Gly Ser Glu Ser Ser Asn Asn Ala Glu Ser Ser Asn lie Ser 
130 135 140 

caa gaa age aaa eta ata aat aca ttg act gat gaa aat gag aaa etc 480 
Gin Glu Ser Lys Leu lie Asn Thr Leu Thr Asp Glu Asn Glu Lys Leu 
145. 150 155 160 

aga gaa gag etc caa cag tat tat gca tta agt gat get aaa gaa gaa 528 
Arg Glu Glu Leu Gin Gin Tyr Tyr Ala Leu Ser Asp Ala Lys Glu Glu 
165 170 175 

gaa cct cgt tat aaa gea etg eag act gaa gtt aag get geg ggg caa 576 
Glu Pro Arg Tyr Lys Ala Leu Gin Thr Glu Val Lys Ala Ala Gly Gin 
180 185 190 

age get cct aaa ggt aca aac gtg age gca gae eta tat aat teg eta 624 
Ser Ala Pro Lys Gly Thr Asn Val Ser Ala Asp Leu Tyr Asn Ser Leu 
195 200 205 

tgg gat gaa aat aaa act ett aga gaa aaa caa gaa gag tat ata aca 672 
Trp Asp Glu Asn Lys Thr Leu Arg Glu Lys Gin Glu Glu Tyr lie Thr 
210 215 220 

aaa att caa aat gaa gag aca aaa aat aaa ggt ace gaa caa gca aaa 720 
Lys lie Gin Asn Glu Glu Thr Lys Asn Lys Gly Thr Glu Gin Ala Lys 
225 230 235 . 240 

aat aat aat ggg gaa etc aca tta eag caa aaa tac gat gca ttg act 7 68 
Asn Asn Asn Gly Glu Leu Thr Leu Gin Gin Lys Tyr Asp Ala Leu Thr 
245 250 255 

aat gag aat aag tct ett cgt cgt gag cgt gat aac tat tta aat tat 816 
Asn Glu Asn Lys Ser Leu Arg Arg Glu Arg Asp Asn Tyr Leu Asn Tyr 
260 265 270 

tta tat gaa aaa cca tgg gaa gag cat gaa aaa gta aca caa gcc aga 864 
Leu Tyr Glu Lys Pro Trp Glu Glu His Glu Lys Val Thr Gin Ala Arg 
275 280 285 

gaa geg gtt ate aga gag atg caa eag agg ggg aca aat ttt gga cct 912 
Glu Ala Val lie Arg Glu Met Gin Gin Arg Gly Thr Asn Phe Gly Pro 
290 295 300 

ctg tta gca agt aca atg ega gat aat cac aat tta aaa gaa aeg ctt 960 
Leu Leu Ala Ser Thr Met Arg Asp Asn His Asn Leu Lys Glu Thr Leu 
305 310 315 320 
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gac aaa act cac gtg agt aag aac cct gtc cct gtc aaa aaa gaa gca 1008 

Asp Lys Thr His Val Ser Lys Asn Pro Val Pro Val Lys Lys Glu Ala 
325 330 335 

aaa tta agt gaa gca gaa tta cat gac aaa att aaa aac ctt gaa gag 1056 

Lys Leu Ser Glu Ala Glu Leu His Asp Lys He Lys Asn Leu Glu Glu 

340 345 350 

gaa aaa gca gaa tta ttc gag aaa etc gag cac cac cac cac cac cac 1104 

Glu Lys Ala Glu Leu Phe Glu Lys Leu Glu His His His His His His 

355 360 365 

tga 1107 



<210> 4 
<211> 368 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hybrid of Group A Streptococci M 
protein and M-like protein 

<400> 4 

Met Ser Lys Asn Pro Val Pro Val Lys Lys Glu Ala Lys Leu Ser Glu 
1-5 10 15 

Ala Glu Leu His Asp Lys He Lys Asn Leu Glu Glu Glu Lys Ala Glu 
20 25 30 

Leu Phe Glu Lys Leu Asp Glu Glu His Pro Asp Val Val Ala Ala Arg 
35 40 45 

Glu Ser Val Leu Asn Asn Val Arg Val Pro Gly Thr Leu Trp Leu Arg 
50 55 60 

Gin Lys Glu Glu Asn Asp Lys Leu Lys Leu Glu Lys Lys Gly Leu Glu 
65 70 75 80 

Thr Glu Leu Gin Glu Lys Glu Gin Ala Ser Glu Glu Ala Ser Asn Asn 
85 90 95 

Gly Gin Leu Thr Leu Gin His Lys Asn Asn Ala Leu Thr Ser Glu Asn 
100 105 lib 

Glu Ser Leu Arg Arg Glu Lys Asp Arg Tyr Leu Tyr Glu Lys Glu Glu 
115 120 125 

Leu Glu Gly Ser Glu Ser Ser Asn Asn Ala Glu Ser Ser Asn He Ser 
130 135 140 

Gin Glu Ser Lys Leu He Asn Thr Leu Thr Asp Glu Asn Glu Lys Leu 
145 150 155 160 

Arg Glu Glu Leu Gin Gin Tyr Tyr Ala Leu Ser Asp Ala Lys Glu Glu 
165 170 175 
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Glu Pro Arg Tyr Lys Ala Leu Gin Thr Glu Val Lys Ala Ala Gly Gin 
180 185 190 

Ser Ala Pro Lys Gly Thr Asn Val Ser Ala Asp Leu Tyr Asn Ser Leu 
195 200 205 

Trp Asp Glu Asn Lys Thr Leu Arg Glu Lys Gin Glu Glu Tyr lie Thr 
210 215 220 

Lys lie Gin Asn Glu Glu Thr Lys Asn Lys Gly Thr Glu Gin Ala Lys 
225 230 235 240 

Asn Asn Asn Gly Glu Leu Thr Leu Gin Gin Lys Tyr Asp Ala Leu Thr 
245 250 255 

Asn Glu Asn Lys Ser Leu Arg Arg Glu Arg Asp Asn Tyr Leu Asn Tyr 
260 265 270 

Leu Tyr Glu Lys Pro Trp Glu Glu His Glu Lys Val Thr Gin Ala Arg 
275 280 285 

Glu Ala Val He Arg Glu Met Gin Gin Arg Gly Thr Asn Phe Gly Pro 
290 295 300 

Leu Leu Ala Ser Thr Met Arg Asp Asn His Asn Leu Lys Glu Thr Leu 
305 310 315 320 

Asp Lys Thr His Val Ser Lys Asn Pro Val Pro Val Lys Lys Glu Ala 
325 330 335 

Lys Leu Ser Glu Ala Glu Leu His Asp Lys He Lys Asn Leu Glu Glu 
340 345 350 

Glu Lys Ala Glu Leu Phe Glu Lys Leu Glu His His His His His His 
355 360 365 



<210> 5 
<211> 1209 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1)..(1206) 

<220> 

<223> Description of Artificial Sequence: Nucleic acid 
encoding Hybrid of Group A Streptococci M protein 
and M-like protein 

<400> 5 

atg agt gac aat att aat cgt tct gtc tct gtc aaa gat aat gaa aaa 48 

Met Ser Asp Asn He Asn Arg Ser Val Ser Val Lys Asp Asn Glu Lys 
15 10 15 
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gaa tta cat aac aaa att gca gac ctt gaa gag gaa agg ggt gaa cat 96 
Glu Leu His Asn Lys lie Ala Asp Leu Glu Glu Glu Arg Gly Glu His 
20 25 30 

eta gac aaa ata gat gaa eta aaa gaa gaa eta aaa gca aag gaa aaa 144 
Leu Asp Lys lie Asp Glu Leu Lys Glu Glu Leu Lys Ala Lys Glu Lys 
35 40 45 

agt tea gga tec get gat eae ect age tat aec get get aaa gat gaa 192 
Ser Ser Gly Ser Ala Asp His Pro Ser Tyr Thr Ala Ala Lys Asp Glu 
50 55 60 

gta eta agt aag ttc tct gta ccg ggt cat gtt tgg gca cat gaa aga 240 
Val Leu Ser Lys Phe Ser Val Pro Gly His Val Trp Ala His Glu Arg 
65 70 75 80 

gaa aaa aat gac aaa ctt age teg gaa aat gaa ggg ctt aag get ggt 288 
Glu Lys Asn Asp Lys Leu Ser ^er Glu Asn Glu Gly Leu Lys Ala Gly 
85 90 95 

tta cag gaa aag gaa caa get age gaa ggg gtt tct gta ggt tea gat 336 
Leu Gin Glu Lys Glu Gin Ala Ser Glu Gly Val Ser Val Gly Ser Asp 
100 105 110 

gca tea eta eat aac egc att aca gac ctt gaa gag gaa aga gaa aaa 384 
Ala Ser Leu His hsn Arg lie Thr Asp Leu Glu Glu Glu Arg Glu Lys 
115 120 125 

tta tta aat aaa tta gat aaa gtt gaa gaa gag cat aaa aaa gat cat 432 
Leu Leu Asn Lys Leu Asp Lys Val Glu Glu Glu His Lys Lys Asp His 
130 135 140 

gaa caa ggt ace aac agt aag aac cct gee cct gee ect gee tct get 480 
Glu Gin Gly Thr Asn Ser Lys Asn Pro Ala Pro Ala Pro Ala Ser Ala 
145 150 155 160 

gtc cct gtc aaa aaa gaa gea aca aaa tta agt gaa gca gaa tta tat 528 
Val Pro Val Lys Lys Glu Ala Thr Lys Leu Ser Glu Ala Glu Leu Tyr 
165 170 175 

aac aaa att caa gaa ctt gaa gag gga aaa gca gaa tta ttc gga tec 576 
Asn Lys lie Gin Glu Leu Glu Glu Gly Lys Ala Glu Leu Phe Gly Ser 
180 185 190 

gaa gaa gaa egt act ttt act gag tta eca tat gaa gca ega tac aaa 624 
Glu Glu Glu Arg Thr Phe Thr Glu Leu Pro Tyr Glu Ala Arg Tyr Lys 
195 200 205 

gca tgg aaa agt gaa aat gat gag ctt egg gaa aat tat egt cgt ace 672 
Ala Trp Lys Ser Glu Asn Asp Glu Leu Arg Glu Asn Tyr . Arg Arg Thr 
210 215 220 

tta gat aag ttt aat act gag caa ggt aag act acg cgc tta gaa gaa 720 
Leu Asp Lys Phe Asn Thr Glu Gin Gly Lys Thr Thr Arg Leu Glu Glu 
225 230 235 240 

caa aat aag ctt gcg gac gcg aac teg aaa age gtt tct aat agt aac 768 
Gin Asn Lys Leu Ala Asp Ala Asn Ser Lys Ser Val Ser Asn Ser Asn 
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245 250 255 

gtg age ata aat eta tat aat gag eta eag get gaa eat gat aag eta 816 

Val Ser He Asn Leu Tyr Asn Glu Leu Gin Ala Glu His Asp Lys Leu 
260 265 270 

eag act aaa cat gag gag eta ttg get gaa eat gat get ctt aaa gaa 864 

Gin Thr Lys His Glu Glu Leu Leu Ala Glu His Asp Ala Leu Lys Glu 
275 280 285 

aaa caa gat aaa aat eaa gaa ttc gat gae egg age gtt tet act aat 912 

Lys Gin Asp Lys Asn Gin Glu Phe Asp Asp Arg Ser Val Ser Thr Asn 
290 295 300 

agt ggt age gtg age aca cca tat aat aae eta ttg aat gaa tat gat 960 

Ser Gly Ser Val Ser Thr Pro Tyr Asn Asn Leu Leu Asn Glu Tyr Asp 

305 310 315 320 

gae eta ttg get aaa cat ggt gag eta ttg agt gaa tat gat get ett 1008 

Asp Leu Leu Ala Lys His Gly Glu Leu Leu Ser Glu Tyr Asp Ala Leu 

325 330 335 

aaa gaa aaa caa gat aaa aat caa gaa get age agt gae aat att aat 1056 

Lys Glu Lys Gin Asp Lys Asn Gin Glu Ala Ser Ser Asp Asn lie Asn 
340 345 350 

cgt tct gtc tet gtc aaa gat aat gaa aaa gaa tta cat aac aaa att 1104 

Arg Ser Val Ser Val Lys Asp Asn Glu Lys Glu Leu His Asn Lys lie 
355 360 365 

gea gae ctt gaa gag gaa agg ggt gaa cat eta gae aaa ata gat gaa 1152 

Ala Asp Leu Glu Glu Glu Arg Gly Glu His Leu Asp Lys He Asp Glu 
370 375 380 

eta aaa gaa gaa eta aaa gea aag gaa aaa agt tea eae eae eae cae 1200 

Leu Lys Glu Glu Leu Lys Ala Lys Glu Lys Ser Ser His His His His 

385 390 395 400 

cae cae tga 1209 
His His 



<210> 6 
<211> 402 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hybrid of Group A Streptococci M 
protein and M-like protein 

<400> 6 

Met Ser Asp Asn He Asn Arg Ser Val Ser Val Lys Asp Asn Glu Lys 

15 10 15 



Glu Leu His Asn Lys He Ala Asp Leu Glu Glu Glu Arg Gly Glu His 
20 25 30 
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Leu Asp Lys lie Asp Glu Leu Lys Glu Glu Leu Lys Ala Lys Glu Lys 
35 40 45 

Ser Ser Gly Ser Ala Asp His Pro Ser Tyr Thr Ala Ala Lys Asp Glu 
50 55 60 

Val Leu Ser Lys Phe Ser Val Pro Gly His Val Trp Ala His Glu Arg 
65 70 75 80 

Glu Lys Asn Asp Lys Leu Ser Ser Glu Asn Glu Gly Leu Lys Ala Gly 
85 90 95 

Leu Gin Glu Lys Glu Gin Ala Ser Glu Gly Val Ser Val Gly Ser Asp 
100 105 110 

Ala Ser Leu His Asn Arg lie Thr Asp Leu Glu Glu Glu Arg Glu Lys 
115 120 125 

Leu Leu Asn Lys Leu Asp Lys Val Glu Glu Glu His Lys Lys Asp His 
130 135 140 

Glu Gin Gly Thr Asn Ser Lys Asn Pro Ala Pro Ala Pro Ala Ser Ala 
145 150 155 160 

Val Pro Val Lys Lys Glu Ala Thr Lys Leu Ser Glu Ala Glu Leu Tyr 
165 170 175 

Asn Lys lie Gin Glu Leu Glu Glu Gly Lys Ala Glu Leu Phe Gly Ser 
180 185 190 

Glu Glu Glu Arg Thr Phe Thr Glu Leu Pro Tyr Glu Ala Arg Tyr Lys 
195 200 205 

Ala Trp Lys Ser Glu Asn Asp Glu Leu Arg Glu Asn Tyr Arg Arg Thr 
210 215 220 

Leu Asp Lys Phe Asn Thr Glu Gin Gly Lys Thr Thr Arg Leu Glu Glu 
225 230 235 240 

Gin Asn Lys Leu Ala Asp Ala Asn Ser Lys Ser Val Ser Asn Ser Asn 
245 250 255 

Val Ser lie Asn Leu Tyr Asn Glu Leu Gin Ala Glu His Asp Lys Leu 
260 265 270 

Gin Thr Lys His Glu Glu Leu Leu Ala Glu His Asp Ala Leu Lys Glu 
275 280 285 

Lys Gin Asp Lys Asn Gin Glu Phe Asp Asp Arg Ser Val Ser Thr Asn 
290 295 300 

Ser Gly Ser Val Ser Thr Pro Tyr Asn Asn Leu Leu Asn Glu Tyr Asp 
305 310 315 320 



Asp Leu Leu Ala Lys His Gly Glu Leu Leu Ser Glu Tyr Asp Ala Leu 
325 330 335 



Lys Glu Lys Gin Asp Lys Asn Gin Glu Ala Ser Ser Asp Asn lie Asn 
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340 345 350 

Arg Ser Val Ser Val Lys Asp Asn Glu Lys Glu Leu His Asn Lys lie 
355 360 365 

Ala Asp Leu Glu Glu Glu Arg Gly Glu His Leu Asp Lys lie Asp Glu 
370 375 380 

Leu Lys Glu Glu Leu Lys Ala Lys Glu Lys Ser Ser His His His His 
385 390 395 400 

His His 



<210> 7 
<211> 1287. 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (1284) 

<220> 

<223> Description of Artificial Sequence: Hybrid of 

Group A Streptococci M protein and M-like protein 

<400> 7 

atg aac ggt gat ggt aat cct agg gaa gtt ata gaa gat ctt gca gca 48 

Met Asn Gly Asp Gly Asn Pro Arg Glu Val lie Glu Asp Leu Ala Ala 

15 10 15 

aac aat ccc gca ata caa aat ata cgt tta cgt cac gaa aac aag gac 96 
Asn Asn Pro Ala lie Gin Asn lie Arg Leu Arg His Glu Asn Lys Asp 
20 25 3.0 

tta aaa gcg aga tta gag aat gca atg gaa gtt gca gga aga gat ttt 144 
Leu Lys Ala Arg Leu Glu Asn Ala Met Glu Val Ala Gly Arg Asp Phe 
35 40 45 

aag aga get etc gac gat cat agt gat tta gtc gca gaa aaa caa cgt 192 
Lys Arg Ala Leu Asp Asp His Ser Asp Leu Val Ala Glu Lys Gin Arg 
50 55 60 

tta gaa gat tta gga caa aaa ttt gaa aga ctg aaa cag cgt tea gaa 240 
Leu Glu Asp Leu Gly Gin Lys Phe Glu Arg Leu Lys Gin Arg Ser Glu 
65 70 75 80 

etc tac ctt cag caa tac tat gat aat aaa tea aat gga tat aaa ggt 288 
Leu Tyr Leu Gin Gin Tyr Tyr Asp Asn Lys Ser Asn Gly Tyr Lys Gly 
85 90 95 

gac tgg tat gta caa cag tta gga tec gat tea gta agt gga tta gag 336 
Asp Trp Tyr Val Gin Gin Leu Gly Ser Asp Ser Val Ser Gly Leu Glu 
100 105 110 
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gtg gca gac ccc tct gat agt aag aaa ctt att gaa tta ggt ttg get 384 
Val Ala Asp Pro Ser Asp Ser Lys Lys Leu lie Glu Leu Gly Leu Ala 
115 120 125 

aaa tac ctt aat gat aaa tta ccc ttt aaa act aaa gaa gat tea gag 432 
Lys Tyr Leu Asn Asp Lys Leu Pro Phe Lys Thr Lys Glu Asp Ser Glu 
130 135 140 

att tta tea gag tta cgt gat gta tta aaa aat ctg cag gag tct cca 4 80 
He Leu Ser Glu Leu Arg Asp Val Leu Lys Asn Leu Gin Glu Ser Pro 
145 150 155 160 

aaa agt act gag act tct get aat gga get gat aaa tta get gat gca 528 
Lys Ser Thr Glu Thr Ser Ala Asn Gly Ala Asp Lys Leu Ala Asp Ala 
165 170 175 

tac aac aca ttg ctt act gaa cat gag aaa etc aga gat gag tat tat 576 
Tyr Asn Thr Leu Leu Thr Glu His Glu Lys Leu Arg Asp Glu Tyr Tyr 
180 185 190 

aca tta att gat get aaa gaa gaa gaa cet cgc tat aaa gca ttg ggt 624 
Thr Leu He Asp Ala Lys Glu Glu Glu Pro Arg Tyr Lys Ala Leu Gly 
195 200 205 

acc ttg tta gat cag gtt aca caa tta tat act aaa cat aat agt aat 672 
Thr Leu Leu Asp Gin Val Thr Gin Leu Tyr Thr Lys His Asn Ser Asn 
210 215 220 

tac caa caa tat aat gca caa get gge aga ctt gac ctg aga caa aag 720 
Tyr Gin Gin Tyr Asn Ala Gin Ala Gly Arg Leu Asp Leu Arg Gin Lys 
225 230 235 240 

get gaa tat eta aaa gge ctt aat gat tgg get gag cgc ctg tta caa 768 
Ala Glu Tyr Leu Lys Gly Leu Asn Asp Trp Ala Glu Arg Leu Leu Gin 

245 250 255 

gag tta aat ggt ace aac aat gat ggt cgt tct cgt gac gtt acg gaa 816 
Glu Leu Asn Gly Thr Asn Asn Asp Gly Arg Ser Arg Asp Val Thr Glu 
260 265 270 

gag att gca gca aac aat acc aca gta caa aat ata cgt tta cgt aac 864 
Glu He Ala Ala Asn Asn Thr Thr Val Gin Asn He Arg Leu Arg Asn 
275 280 285 

gaa aac aag aac tta aaa gcg aaa aac gag gac tta gaa gcg aga tta 912 
Glu Asn Lys Asn Leu Lys Ala Lys Asn Glu Asp Leu Glu Ala Arg Leu 
290 295 300 

gag aat gca atg aat gtt gca gga cgc gat ttt aag cgt get gaa ttc 960 
Glu Asn Ala Met Asn Val Ala Gly Arg Asp Phe Lys Arg Ala Glu Phe 
305 310 315 320 

gca cet ctt act cgt get aca gca gac aat aaa gac gaa tta ata aaa 1008 
Ala Pro Leu Thr Arg Ala Thr Ala Asp Asn Lys Asp Glu Leu He Lys 
325 330 335 

aga get aac ggt tat gag ata cag aac cat cag tta aca gtt gag aat 1056 
Arg Ala Asn Gly Tyr Glu He Gin Asn His Gin Leu Thr Val Glu Asn 
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340 345 350 

aaa aaa tta aaa act gat aag gaa cag tta aca aaa gag aat gat gat 1104 
Lys Lys Leu Lys Thr Asp Lys Glu Gin Leu Thr Lys Glu Asn Asp Asp 
355 360 365 

tta aaa cac gtg aac ggt gat ggt aat cct cgt gaa gtt ata gaa gat 1152 
Leu Lys His Val Asn Gly Asp Gly Asn Pro Arg Glu Val lie Glu Asp 
370 375 380 

ctt gca gca aac aat ccc gca ata caa aat ata cgt tta cgt cac gaa 1200 
Leu Ala Ala Asn Asn Pro Ala lie Gin Asn lie Arg Leu Arg His Glu 
385 390 395 400 

aac aag gac tta aaa gcg aga tta gag aat gca atg gaa gtt gca gga 1248 
Asn Lys Asp Leu Lys Ala Arg Leu Glu Asn Ala Met Glu Val Ala Gly 
405 410 415 

cgt gat ttt aag cgt get cac cac cac cac cac cac taa 1287 
Arg Asp Phe Lys Arg Ala His His His His His His 
420 425 



<210> 8 
<211> 428 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hybrid of Group A Streptococci M 
protein and M-like protein 

<400> 8 

Met Asn Gly Asp Gly Asn Pro Arg Glu Val lie Glu Asp Leu Ala Ala 

15 10 15 

Asn Asn Pro Ala lie Gin Asn lie Arg Leu Arg His Glu Asn Lys Asp 
20 25 30 . 

Leu Lys Ala Arg Leu Glu Asn Ala Met Glu Val Ala Gly Arg Asp Phe 
35 40 45 

Lys Arg Ala Leu Asp Asp His Ser Asp Leu Val Ala Glu Lys Gin Arg 
50 55 60 

Leu Glu Asp Leu Gly Gin Lys Phe Glu Arg Leu Lys Gin Arg Ser Glu 
65 70 75 80 

Leu Tyr Leu Gin Gin Tyr Tyr Asp Asn Lys Ser Asn Gly Tyr Lys Gly 
85 90 95 

Asp Trp Tyr Val Gin Gin Leu Gly Ser Asp Ser Val Ser Gly Leu Glu 
100 105 110 

Val Ala Asp Pro Ser Asp Ser Lys Lys Leu lie Glu Leu Gly Leu Ala 
115 120 125 

Lys Tyr Leu Asn Asp Lys Leu Pro Phe Lys Thr Lys Glu Asp Ser Glu 
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135 



140 



lie Leu Ser Glu Leu Arg Asp Val Leu Lys Asn Leu Gin Glu Ser Pro 
145 150 155 160 

Lys Ser Thr Glu Thr Ser Ala Asn Gly Ala Asp Lys Leu Ala Asp Ala 

165 170 175 

Tyr Asn Thr Leu Leu Thr Glu His Glu Lys Leu Arg Asp Glu Tyr Tyr 
180 185 190 

Thr Leu lie Asp Ala Lys Glu Glu Glu Pro Arg Tyr Lys Ala Leu Gly 
195 200 205 

Thr Leu Leu Asp Gin Val Thr Gin Leu Tyr Thr Lys His Asn Ser Asn 
210 215 220 

Tyr Gin Gin Tyr Asn Ala Gin Ala Gly Arg Leu Asp Leu Arg Gin Lys 
225 230 235 240 

Ala Glu Tyr Leu Lys Gly Leu Asn Asp Trp Ala Glu Arg Leu Leu Gin 
245 250 255 

Glu Leu Asn Gly Thr Asn Asn Asp Gly Arg Ser Arg Asp Val Thr Glu 
260 265 270 

Glu lie Ala Ala Asn Asn Thr Thr Val Gin Asn lie Arg Leu Arg Asn 

275 280 285 

Glu Asn Lys Asn Leu Lys Ala Lys Asn Glu Asp Leu Glu Ala Arg Leu 
290 295 300 

Glu Asn Ala Met Asn Val Ala Gly Arg Asp Phe Lys Arg Ala Glu Phe 
305 310 315 320 

Ala Pro Leu Thr Arg Ala Thr Ala Asp Asn Lys Asp Glu Leu lie Lys 
325 330 335 

Arg Ala Asn Gly Tyr Glu lie Gin Asn His Gin Leu Thr Val Glu Asn 
340 345 350 

Lys Lys Leu Lys Thr Asp Lys Glu Gin Leu Thr Lys Glu Asn Asp Asp 
355 360 365 

Leu Lys His Val Asn Gly Asp Gly Asn Pro Arg Glu Val lie Glu Asp 
370 375 380 

Leu Ala Ala Asn Asn Pro Ala lie Gin Asn lie Arg Leu Arg His Glu 
385 390 395 400 

Asn Lys Asp Leu Lys Ala Arg Leu Glu Asn Ala Met Glu Val Ala Gly 
405 410 415 



Arg Asp Phe Lys Arg Ala His His His His His His 
420 425 
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<210> 9 
<211> 1008 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> <1) . . (1005) 

<220> 

<223> Description of Artificial Sequence: Nucleic acid 
encoding Hybrid of Group A Streptococci M protein 
and M-like protein 

<400> 9 

atg gtc gcg act agg tct cag aca gat act ctg gaa aaa gta caa gaa 48 
Met Val Ala Thr Arg Ser Gin Thr Asp Thr Leu Glu Lys Val Gin Glu 
15 10 15 

cgt get gac aag ttt gag ata gaa aac aat acg tta aaa ctt aag aat 96 
Arg Ala Asp Lys Phe Glu lie Glu Asn Asn Thr Leu Lys Leu Lys Asn 
20 25 30 

agt gac tta agt ttt aat aat aaa gcg tta aaa gat cat aat gat gag 14 4 
Ser Asp Leu Ser Phe Asn Asn Lys Ala Leu Lys Asp His Asn Asp Glu 
35 40 45 

tta act gaa gag ttg agt aat get aaa gag aaa eta cgt cac gtg gcc 192 
Leu Thr Glu Glu Leu Ser Asn Ala Lys Glu Lys Leu Arg His Val Ala 
50 55 60 

gtg act agg ggt aca ata aat gac ccg caa aga gca aaa gaa get ctt 240 
Val Thr Arg Gly Thr He Asn Asp Pro Gin Arg Ala Lys Glu Ala Leu 
65 70 75 80 

gac aag tat gag eta gaa aac cat gac tta aaa act aag gga tec aga 288 
Asp Lys Tyr Glu Leu Glu Asn His Asp Leu Lys Thr Lys Gly Ser Arg 
85 90 95 

gtg ttt cet agg ggg acg gta gaa aac ccg gac aaa gca ega gaa ctt 336 
Val Phe Pro Arg Gly Thr Val Glu Asn Pro Asp Lys Ala Arg Glu Leu 
100 105 110 

ctt aac aag tat gac gta gag aac tct atg tta caa get aat aat gac 384 
Leu Asn Lys Tyr Asp Val Glu Asn Ser Met Leu Gin Ala Asn Asn Asp 
115 120 125 

aag tta cca tgg aga gtg cgt tat act agg cat acg cca gaa gat aag 4 32 
Lys Leu Pro Trp Arg Val Arg Tyr Thr Arg His Thr Pro Glu Asp Lys 
130 135 140 

eta aaa aaa att att gac gat ctt gac gca aaa gaa cat gaa tta caa • 480 
Leu Lys Lys He He Asp Asp Leu Asp Ala Lys Glu His Glu Leu Gin 
145 150 155 160 

caa cag aat gag aag tta tct ctg cag aaa gtg tat att act agg ggt 528 
Gin Gin Asn Glu Lys Leu Ser Leu Gin Lys Val Tyr He Thr Arg Gly 
165 170 175 



wo 03/065973 



PCT/USfl2/34769 



16 



dtg acd aaa gag gac gta gaa aaa att get aac aac ctt gac ata gaa 576 

Met Thr Lys Glu Asp Val Glu Lys lie Ala Asn Asn Leu Asp lie Glu 
180 185 190 

aac cat ggg tta aaa caa cag aat gaa cag tta tot act gat aaa caa 624 

Asn His Gly Leu Lys Gin Gin Asn Glu Gin Leu Ser Thr Asp Lys Gin 
195 200 205 

ggt ctt gaa gaa cag aat ggt ace gat aga gtt agt agg tct atg tea 672 

Gly Leu Glu Glu Gin Asin Gly Thr Asp Arg Val Ser Arg Ser Met Ser 

210 215 220 

aga gat gat eta tta aac agg get cag gat ctt gaa gca aaa aac cac 720 

Arg Asp Asp Leu Leu Asn Arg Ala Gin Asp Leu Glu Ala Lys Asn His 
225 230 235 240 

ggg tta gaa cac cag aat act aag tta tct act gaa aat aaa acg ctt 768 

Gly Leu Glu His Gin Asn Thr Lys Leu Ser Thr Glu Asn Lys Thr Leu 
245 250 255 

caa gaa caa gca gaa gca cgc cag aaa gaa ate gat gtc gcg act agg 816 

Gin Glu Gin Ala Glu Ala Arg Gin Lys Glu lie Asp Val Ala Thr Arg 
260 265 270 

tct cag aca gat act ctg gaa aaa gta caa gaa cgt get gac aag ttt 864 

Ser Gin Thr Asp Thr Leu Glu Lys Val Gin Glu Arg Ala Asp Lys Phe 
275 280 285 

gag ata gaa aac aat acg tta aaa ctt aag aat agt gac tta agt ttt .912 

Glu lie Glu Asn Asn Thr Leu Lys Leu Lys Asn Ser Asp Leu Ser Phe 

290 295 300 

aat aat aaa gcg tta aaa gat cat aat gat gag tta act gaa gag ttg 960 

Asn Asn Lys Ala Leu Lys Asp His Asn Asp Glu Leu Thr Glu Glu Leu 
305 310 315 320 

agt aat get aaa gag aaa eta cgt cac cac cac cac cac cac tgc tga 1008 

Ser Asn Ala Lys Glu Lys Leu Arg His His His His His His Cys 
325 330 335 



<210> 10 
<211> 335 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hybrid of Group A Streptococci M 
protein and M-like protein 

<400> 10 

Met Val Ala Thr Arg Ser Gin Thr Asp Thr Leu Glu Lys Val Gin Glu 
15 10 15 



Arg Ala Asp Lys Phe Glu lie Glu Asn Asn Thr Leu Lys Leu Lys Asn 
20 25 30 
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Ser Asp Leu Ser Phe Asn Asn Lys Ala Leu Lys Asp His Asn Asp Glu 
35 40 45 

Leu Thr Glu Glu Leu Ser Asn Ala Lys Glu Lys Leu Arg His Val Ala 
50 55 60 

Val Thr Arg Gly Thr lie Asn Asp Pro Gin Arg Ala Lys Glu Ala Leu 
65 70 75 80 

Asp Lys Tyr Glu Leu Glu Asn His Asp Leu Lys Thr Lys Gly Ser Arg 
85 90 95 

Val Phe Pro Arg Gly Thr Val Glu Asn Pro Asp Lys Ala Arg Glu Leu 
100 105 110 

Leu Asn Lys Tyr Asp Val Glu Asn Ser Met Leu Gin Ala Asn Asn Asp 
115 120 125 

Lys Leu Pro Trp Arg Val Arg Tyr Thr Arg His Thr Pro Glu Asp Lys 
130 135 140 

Leu Lys Lys lie lie Asp Asp Leu Asp Ala Lys Glu His Glu Leu Gin 
145 150 155 • 160 

Gin Gin Asn Glu Lys Leu Ser Leu Gin Lys Val Tyr lie Thr Arg Gly 
165 170 175 

Met Thr Lys Glu Asp Val Glu Lys lie Ala Asn Asn Leu Asp lie Glu 
180 185 190 

Asn His Gly Leu Lys Gin Gin Asn Glu Gin Leu Ser Thr Asp Lys Gin 
195 200 205 

Gly Leu Glu Glu Gin Asn Gly Thr Asp Arg Val Ser Arg Ser Met Ser 
210 215 220 

Arg Asp Asp Leu Leu Asn Arg Ala Gin Asp Leu Glu Ala Lys Asn His 
225 230 235 240 

Gly Leu Glu His Gin Asn Thr Lys Leu Ser Thr Glu Asn Lys Thr Leu 
245 250 255 

Gin Glu Gin Ala Glu Ala Arg Gin Lys Glu He Asp Val Ala Thr Arg 

260 265 270 

Ser Gin Thr Asp Thr Leu Glu Lys Val Gin Glu Arg Ala Asp Lys Phe 
275 280 285 

Glu He Glu Asn Asn Thr Leu Lys Leu Lys Asn Ser Asp Leu Ser Phe 
290 295 300 

Asn Asn Lys Ala Leu Lys Asp His Asn Asp Glu Leu Thr Glu Glu Leu 
305 310 315 320 



Ser Asn Ala Lys Glu Lys Leu Arg His His His His His His Cys 
325 330 335 
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<210> 11 
<211> 1194 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1)..(1I91) 

<220> 

<223> Description of Artificial Sequence: Hybrid of 

Group A Streptococci M protein and M^like protein 

<400> 11 

atg agt aag aac cct gtc cct gtc aaa aaa gaa gca aaa tta agt gaa 4 8 

Met Ser Lys Asn Pro Val Pro Val Lys Lys Glu Ala Lys Leu Ser Glu 

1 .5 10 15 . 

gca gaa tta cat gac aaa att aaa aac ctt gaa gag gaa aaa gca gaa 96 
Ala Glu. Leu His Asp Lys He Lys Asn Leu Glu Glu Glu Lys Ala Glu 
20 25 30 

tta ttc gag aaa tta gat aaa gtt gaa gaa gag cat aaa aaa gtt gaa 144 
Leu Phe Glu Lys Leu Asp Lys Val Glu Glu Glu His Lys Lys Val Glu 
35 40 45 

gaa gag cat gtc gac gaa gaa cac cct gac gtt gtc get get aga gaa 192 
Glu Glu His Val Asp Glu Glu His Pro Asp Val Val Ala Ala Arg Glu 
50 55 60 

age gta eta aat aat gtc cgt gta ccg ggt aca ctt tgg eta cgt caa 240 
Ser Val Leu Asn Asn Val Arg Val Pro Gly Thr Leu Trp Leu Arg Gin 
65 70 75 80 

aaa gaa gaa aat gac aaa ctt aaa ttg gaa aag aaa ggg ctt gag act 288 
Lys Glu Glu Asn Asp Lys Leu Lys Leu Glu Lys Lys Gly Leu Glu Thr 
85 90 95 

gag tta cag gaa aag gaa caa get age gaa gaa gca tea aat aat ggg 336 
Glu Leu Gin Glu Lys Glu Gin Ala Ser Glu Glu Ala Ser Asn Asn Gly 
100 105 110 

caa etc aca tta cag eat aaa aat aat gca ttg act agt gag aat gag 384 
Gin Leu Thr Leu Gin His Lys Asn Asn Ala Leu Thr Ser Glu Asn Glu 
115 120 125 

tct ctt aga aga gaa aaa gat aga tat ttg tat gaa aaa gaa gaa tta 4 32 
Ser Leu Arg Arg Glu Lys Asp Arg Tyr Leu Tyr Glu Lys Glu Glu Leu 
130 135 140 

gaa gga tec gag tea tea aat aat geg gag tea tea aac att tct caa 480 
Glu Gly Ser Glu Ser Ser Asn Asn Ala Glu Ser Ser Asn He Ser Gin 
145 150 155 160 

gaa age aaa eta ata aat aca ttg act gat gaa aat gag aaa etc aga 528 
Glu Ser Lys Leu He Asn Thr Leu Thr Asp Glu Asn Glu Lys Leu Arg 
165 170 175 
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gaa gag etc caa cag tat tat gca tta agt gat get aaa gaa gaa gaa 576 
Glu Glu Leu Gin Gin Tyr Tyr Ala Leu Ser Asp Ala Lys Glu Glu Glu 
180 185 190 

cct agg tat aaa gca ctg cag act gaa gtt aag get gcg ggg caa age 624 
Pro Arg Tyr Lys Ala Leu Gin Thr Glu Val Lys Ala Ala Gly Gin Ser 
195 200 205 

get cct aaa ggt aca aac gtg age gca gac eta tat aat teg eta tgg 672 
Ala Pro Lys Gly Thr Asn Val Ser Ala Asp Leu Tyr Asn Ser Leu Trp 
210 215 220 

gat gaa aat aaa act ctt aga gaa aaa caa gaa gag tat ata aca aaa 720 
Asp Glu Asn Lys Thr Leu Arg Glu Lys Gin Glu Glu Tyr lie Thr Lys 
• 225 230 235 24 0 

att caa aat gaa gag aca aaa aat aaa ggt ace gaa caa gca aaa aat 768 
He Gin Asn Glu Glu Thr Lys Asn Lys Gly Thr Glu Gin Ala Lys Asn 
245 250 255 

aat aat ggg gaa etc aca tta cag caa aaa tac gat gca ttg act aat 816 
Asn Asn Gly Glu Leu Thr Leu Gin Gin Lys Tyr Asp Ala Leu Thr Asn 
260 265 270 

gag aat aag tct ctt aga aga gag aga gat aac tat tta aat tat tta 864 
Glu Asn Lys Ser Leu Arg Arg Glu Arg Asp Asn Tyr Leu Asn Tyr Leu 
275 280 285 

tat gaa aaa cca tgg gaa gag cat gaa aaa gta aca caa gee aga gaa 912 
Tyr Glu Lys Pro Trp Glu Glu His Glu Lys Val Thr Gin Ala Arg Glu 
290 295 300 

gcg gtt ate aga gag atg caa cag agg ggg aca aat ttt gga cct ctg 960 
Ala Val lie Arg Glu Met Gin Gin Arg Gly Thr Asn Phe Gly Pro Leu 
305 310 315 320 

tta gca agt aca atg cga gat aat cac aat tta aaa gaa acg ctt gac 1008 
Leu Ala Ser Thr Met Arg Asp Asn His Asn Leu Lys Glu Thr Leu Asp 
325 330 335 

aaa act cac gtg agt aag aac cct gtc cct gtc aaa aaa gaa gca aaa 1056 
Lys Thr His Val Ser Lys Asn Pro Val Pro Val Lys Lys Glu Ala Lys 
340 345 350 

tta agt gaa gca gaa tta cat gac aaa att aaa aac ctt gaa gag gaa 1104 
Leu Ser Glu Ala Glu Leu His Asp Lys lie Lys Asn Leu Glu Glu Glu 
355 360 365 

aaa gca gaa tta ttc gag aaa tta gat aaa gtt gaa gaa gag eat aaa 1152 
Lys Ala Glu Leu Phe Glu Lys Leu Asp Lys Val Glu Glu Glu His Lys 
370 375 380 

aaa gtt gaa gaa gag cat cac cac cac cac cac cac tgc taa 1194 
Lys Val Glu Glu Glu His His His His His His His Cys 
385 390 395 
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<210> 12 
<211> 397 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hybrid of Group A Streptococci M 
protein and M-like protein 



Met Ser Lys Asn Pro Val Pro Val Lys Lys Glu Ala Lys Leu Ser Glu 

15 10 15 

Ala Glu Leu His Asp Lys lie Lys Asn Leu Glu Glu Glu Lys Ala Glu 
20 25 30 

Leu Phe Glu Lys Leu Asp Lys Val Glu Glu Glu His Lys Lys Val Glu 
• 35 40 45 

Glu Glu His Val Asp Glu Glu His Pro Asp Val Val Ala Ala Arg Glu 
50 55 60 

Ser Val Leu Asn Asn Val Arg Val Pro Gly Thr Leu Trp Leu Arg Gin 
65 70 75 80 

Lys Glu Glu Asn Asp Lys Leu Lys Leu Glu Lys Lys Gly Leu Glu Thr 
85 90 95 

Glu Leu Gin Glu Lys Glu Gin Ala Ser Glu Glu Ala Ser Asn Asn Gly 
100 105 110 

Gin Leu Thr Leu Gin His Lys Asn Asn Ala Leu Thr Ser Glu Asn Glu 

115 120 125 

Ser Leu Arg Arg Glu Lys Asp Arg Tyr Leu Tyr Glu Lys Glu Glu Leu 
130 135 140 

Glu Gly Ser Glu Ser Ser Asn Asn Ala Glu Ser Ser Asn lie Ser Gin 
145 150 155 160 

Glu Ser Lys Leu lie Asn Thr Leu Thr Asp Glu Asn Glu Lys Leu Arg 
165 170 175 

Glu Glu Leu Gin Gin Tyr Tyr Ala Leu Ser Asp Ala Lys Glu Glu Glu 
180 185 190 

Pro Arg Tyr Lys Ala Leu Gin Thr Glu Val Lys Ala Ala Gly Gin Ser 
195 200 205 

Ala Pro Lys Gly Thr Asn Val Ser Ala Asp Leu Tyr Asn Ser Leu Trp 
210 215 220 

Asp Glu Asn Lys Thr Leu Arg Glu Lys Gin Glu Glu Tyr lie Thr Lys 
225 230 235 240 

lie Gin Asn Glu Glu Thr Lys Asn Lys Gly Thr Glu Gin Ala Lys Asn 



<400> 
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245 



250 



255 
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Asn Asn Gly Glu Leu Thr Leu Gin Gin Lys Tyr Asp Ala Leu Thr Asn 
260 265 270 

Glu Asn Lys Ser Leu Arg Arg Glu Arg Asp Asn Tyr Leu Asn Tyr Leu 
275 280 285 

Tyr Glu Lys Pro Trp Glu Glu His Glu Lys Val Thr Gin Ala Arg Glu 
290 295 300 

Ala Val He Arg Glu Met Gin Gin Arg Gly Thr Asn Phe Gly Pro Leu 
305 310 315 320 

Leu Ala Ser Thr Met Arg Asp Asn His Asn Leu Lys Glu Thr Leu Asp 
325 330 335 

Lys Thr His Val Ser Lys Asn Pro Val Pro Val Lys Lys Glu Ala Lys 
340 345 350 

Leu Ser Glu Ala Glu Leu His Asp Lys He Lys Asn Leu Glu Glu Glu 
355 360 365 

Lys Ala Glu Leu Phe Glu Lys Leu Asp Lys Val Glu Glu Glu His Lys 
370 375 380 

Lys Val Glu Glu Glu His His His His His His His Cys 
385 390 395 



<210> 13 
<211> 1212 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) (1209) 

<220> 

<223> Description of Artificial Sequence: Nucleic acid 
encoding Hybrid of Group A Streptococci M protein 
and M-like protein 

<400> 13 

atg agt gac aat att aat aga tct gtc tct gtc aaa gat aat gaa aaa 48 

Met Ser Asp Asn He Asn Arg Ser Val Ser Val Lys Asp Asn Glu Lys 

15 10 15 

gaa tta cat aac aaa att gca gac ctt gaa gag gaa agg ggt gaa cat 96 
Glu Leu His Asn Lys He Ala Asp Leu Glu Glu Glu Arg Gly Glu His 
20 25 30 

eta gac aaa ata gat gaa eta aaa gaa gaa eta aaa gca aag gaa aaa 144 
Leu Asp Lys He Asp Glu Leu Lys Glu Glu Leu Lys Ala Lys Glu Lys 
35 40 45 



agt tea gga tec get gat cac cct age tat acc get get aaa gat gaa 192 
Ser Ser Gly Ser Ala Asp His Pro Ser Tyr Thr Ala Ala Lys Asp Glu 
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50 55 60 

gta eta agt aag ttc tct gta ccg ggt cat gtt tgg gca cat gaa aga 240 

Val Leu Ser Lys Phe Ser Val Pro Gly His Val Trp Ala His Glu Arg 

65 70 75 80 

gaa aaa aat gac aaa ctt age teg gaa aat gaa ggg ctt aag get ggt 288 

Glu Lys Asn Asp Lys Leu Ser Ser Glu Asn Glu Gly Leu Lys Ala Gly 

85 90 95 

tta cag gaa aag gaa caa get age gaa ggg gtt tet gta ggt tea gat 336 

Leu Gin Glu Lys Glu Gin Ala Ser Glu Gly Val Ser Val Gly Ser Asp 

100 105 110 

gca tea eta eat aae cgc att aca gac ctt gaa gag gaa aga gaa aaa 384 

Ala Ser. Leu His. Asn Arg lie Thr Asp Leu Glu Glu Glu Arg Glu Lys 

115 120 125 

tta tta aat aaa tta gat aaa gtt gaa gaa gag cat aaa aaa gat cat 432 

Leu Leu Asn Lys Leu Asp Lys Val Glu Glu Glu His Lys Lys Asp His 

130 . 135 140 

gaa caa ggt ace aae agt aag aac cct gee cet gee eet gee tct get 480 

Glu Gin Gly Thr Asn Ser Lys Asn Pro Ala Pro Ala Pro Ala Ser Ala 

145 150 155 160 

gtc cct gtc aaa aaa gaa gca aca aaa tta agt gaa gea gaa tta tat 528 

Val Pro Val Lys Lys Glu Ala Thr Lys Leu Ser Glu Ala Glu Leu Tyr 

165 170 175 

aac aaa att caa gaa ctt gaa gag gga aaa gca gaa tta ttc gga tee 576 

Asn Lys lie Gin Glu Leu Glu Glu Gly Lys Ala Glu Leu Phe Gly Ser 

180 185 190 

gaa gaa gaa cgt act ttt act gag tta cea tat gaa gca cga tac aaa 624 

Glu Glu Glu Arg Thr Phe Thr Glu Leu Pro Tyr Glu Ala Arg Tyr Lys 

195 200 205 

gca tgg aaa agt gaa aat gat gag ctt egg gaa aat tat aga agg ace 672 

Ala Trp Lys Ser Glu Asn Asp Glu Leu Arg Glu Asn Tyr Arg Arg Thr 

210 215 220 

tta gat aag ttt aat act gag caa ggt aag act acg aga tta gaa gaa 720 

Leu Asp Lys Phe Asn Thr Glu Gin Gly Lys Thr Thr Arg Leu Glu Glu 

225 230 235 240 

caa aat aag ctt gcg gac gcg aac teg aaa age gtt tct aat agt aac 768 

Gin Asn Lys Leu Ala Asp Ala Asn Ser Lys Ser Val Ser Asn Ser Asn 

245 250 255 

gtg age ata aat eta tat aat gag eta cag get gaa cat gat aag eta 816 

Val Ser lie Asn Leu Tyr Asn Glu Leu Gin Ala Glu His Asp Lys Leu 

260 265 270 

cag act aaa cat gag gag eta ttg get gaa eat gat get ett aaa gaa 864 

Gin Thr Lys His Glu Glu Leu Leu Ala Glu His Asp Ala Leu Lys Glu 

275 280 285 
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aaa caa gat aaa aat caa gaa ttc gat gac egg age gtt tct act aat 912 

Lys Gin Asp Lys Asn Gin Glu Phe Asp Asp Arg Ser Val Ser Thr Asn 

290 295 300 

agt ggt age gtg age aca oca tat aat aac eta ttg aat gaa tat gat 960 

Ser Gly Ser Val Ser Thr Pro Tyr Asn Asn Leu Leu Asn Glu Tyr Asp 
305 310 315 320 

gac eta ttg get aaa cat ggt gag eta ttg agt gaa tat gat get ett 1008 

Asp Leu Leu Ala Lys His Gly Glu Leu Leu Ser Glu Tyr Asp Ala Leu 
325 330 335 

aaa gaa aaa caa gat aaa aat caa gaa get age agt gac aat att aat 1056 

Lys Glu Lys Gin Asp Lys Asn Gin Glu Ala Ser Ser Asp Asn lie Asn 
340 345 350 

aga tct gtc tct gtc aaa gat aat gaa aaa gaa tta cat aac aaa att 1104 

Arg Ser Val Ser Val Lys Asp Asn Glu Lys Glu Leu His Asn Lys lie 

355 360 365 

gca gac ctt gaa gag gaa agg ggt gaa eat eta gac aaa ata gat gaa 1152 

Ala Asp Leu Glu Glu Glu Arg Gly Glu His Leu Asp Lys lie Asp Glu 

370 375 380 

eta aaa gaa gaa eta aaa gca aag gaa aaa agt tea cac cac cac cac 1200 

Leu Lys Glu Glu Leu Lys Ala Lys Glu Lys Ser Ser His His His His 
385 390 395 4 00 

cac cac tgc tga 1212 
His His Cys 



<210> 14 
<211> 403 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hybrid of Group A Streptococci M 
protein and M-like protein 

<400> 14 

Met Ser Asp Asn lie Asn Arg Ser Val Ser Val Lys Asp Asn Glu Lys 
15 10 15 

Glu Leu His Asn Lys lie Ala Asp Leu Glu Glu Glu Arg Gly Glu His 
20 25 30 

Leu Asp Lys lie Asp Glu Leu Lys Glu Glu Leu Lys Ala Lys Glu Lys 
35 40 45 

Ser Ser Gly Ser Ala Asp His Pro Ser Tyr Thr Ala Ala Lys Asp Glu 
50 55 60 

Val Leu Ser Lys Phe Ser Val Pro Gly His Val Trp Ala His Glu Arg 
65 70 75 80 

Glu Lys Asn Asp Lys Leu Ser Ser Glu Asn Glu Gly Leu Lys Ala Gly 
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85 



90 



95 



Leu Gin Glu Lys Glu Gin Ala Ser Glu Gly Val Ser Val Gly Ser Asp 
100 105 110 

Ala Ser Leu His Asn Arg lie Thr Asp Leu Glu Glu Glu Arg Glu Lys 
115 120 125 

Leu Leu Asn Lys Leu Asp Lys Val Glu Glu Glu His Lys Lys Asp His 
130 135 140 

Glu Gin Gly Thr Asn Ser Lys Asn Pro Ala Pro Ala Pro Ala Ser Ala 
145 150 155 160 

Val Pro Val Lys Lys Glu Ala Thr Lys Leu Ser Glu Ala Glu Leu Tyr 
165 170 175 

Asn. Lys lie Gin Glu Leu Glu Glu Gly Lys Ala Glu Leu Phe Gly Ser 
180 185 190 

Glu Glu Glu Arg Thr Phe Thr Glu Leu Pro Tyr Glu Ala Arg Tyr Lys 
195 200 205 

Ala Trp Lys Ser Glu Asn T^p Glu Leu Arg Glu Asn Tyr Arg Arg Thr 
210 215 220 

Leu Asp Lys Phe Asn Thr Glu Gin Gly Lys Thr Thr Arg Leu Glu Glu 
225 230 235 240 

Gin Asn Lys Leu Ala Asp Ala Asn Ser Lys Ser Val Ser Asn Ser Asn 
245 250 255 

Val Ser lie Asn Leu Tyr Asn Glu Leu Gin Ala Glu His Asp Lys Leu 
260 265 270 

Gin Thr Lys His Glu Glu Leu Leu Ala Glu His Asp Ala Leu Lys Glu 
275 280 285 

Lys Gin Asp Lys Asn Gin Glu Phe Asp Asp Arg Ser Val Ser Thr Asn 
290 295 300 

Ser Gly Ser Val Ser Thr Pro Tyr Asn Asn Leu Leu Asn Glu Tyr Asp 
305 310 315 320 

Asp Leu Leu Ala Lys His Gly Glu Leu Leu Ser Glu Tyr Asp Ala Leu 
325 330 335 

Lys Glu Lys Gin Asp Lys Asn Gin Glu Ala Ser Ser Asp Asn lie Asn 
340 345 350 

Arg Ser Val Ser Val Lys Asp Asn Glu Lys Glu Leu His Asn Lys lie 
355 360 365 

Ala Asp Leu Glu Glu Glu Arg Gly Glu His Leu Asp Lys lie Asp Glu 
370 375 380 



Leu Lys Glu Glu Leu Lys Ala Lys Glu Lys Ser Ser His His His His 
385 390 395 400 
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His His Cys 



<210> 15 
<211> 1290 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (1287) 

<220> 

<223> Description of Artificial Sequence: Hybrid of 

. Group A Streptococci M protein and M-like protein 

<400> is" 

atg aac ggt gat ggt aat cct agg gaa gtt ata gaa gat ctt gca gca 48 
Met Asn Gly Asp Gly Asn Pro Arg Glu Val lie Glu Asp Leu Ala Ala 
15 10 15 

aac aat ccc gca ata caa aat ata cgt tta cgt cac gaa aac aag gac 96 
Asn Asn Pro Ala lie Gin Asn lie Arg Leu Arg His Glu Asn Lys Asp 
20 25 30 

tta aaa gcg aga tta gag aat gca atg gaa gtt gca gga aga gat ttt 144 
Leu Lys Ala Arg Leu Glu Asn Ala Met Glu Val Ala Gly Arg Asp Phe 
35 40 45 

aag aga get etc gac gat cat agt gat tta gtc gca gaa aaa caa cgt 192 
Lys Arg Ala Leu Asp Asp His Ser Asp Leu Val Ala Glu Lys Gin Arg 
50 55 60 

tta gaa gat tta gga caa aaa ttt gaa aga ctg aaa cag cgt tea gaa 240 
Leu Glu Asp Leu Gly Gin Lys Phe Glu Arg Leu Lys Gin Arg Ser Glu 
65 70 75 80 

etc tac ctt cag caa tac tat gat aat aaa tea aat gga tat aaa ggt 288 
Leu Tyr Leu Gin Gin Tyr Tyr Asp Asn Lys Ser Asn Gly Tyr Lys Gly 
85 90 95 

gac tgg tat gta caa cag tta gga tec gat tea gta agt gga tta gag 336 
Asp Trp Tyr Val Gin Gin Leu Gly Ser Asp Ser Val Ser Gly Leu Glu 
100 105 110 

gtg gca gac ccc tet gat agt aag aaa ctt att gaa tta ggt ttg get 384 
Val Ala Asp Pro Ser Asp Ser Lys Lys Leu lie Glu Leu Gly Leu Ala 
115 120 125 

aaa tac ctt aat gat aaa tta ccc ttt aaa act aaa gaa gat tea gag 432 
Lys Tyr Leu Asn Asp Lys Leu Pro Phe Lys Thr Lys Glu Asp Ser Glu 
130 135 140 

att tta tea gag tta cgt gat gta tta aaa aat ctg cag gag tct cca 480 
lie Leu Ser Glu Leu Arg Asp Val Leu Lys Asn Leu Gin Glu Ser Pro 
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145 150 155 160 

aaa agt act gag act tct get aat gga get gat aaa tta get gat gca 528 

Lys Ser Thr Glu Thr Ser Ala Asn Gly Ala Asp Lys Leu Ala Asp Ala 

165 170 175 

tac aac aca ttg ctt act gaa cat gag aaa etc aga gat gag tat tat 576 

Tyr Asn Thr Leu Leu Thr Glu His Glu Lys Leu Arg Asp Glu Tyr Tyr 

180 185 190 

aea tta att gat get aaa gaa gaa gaa cct agg tat aaa gca ttg ggt 624 

Thr Leu lie Asp Ala Lys Glu Glu Glu Pro Arg Tyr Lys Ala Leu Gly 

195 200 205 

ace ttg tta gat cag gtt aca caa tta tat act aaa cat aat agt aat 672 

Thr Leu Leu Asp Gin Val Thr Gin Leu Tyr Thr Lys His- Asn Ser Asn 
210 215 220 

tac caa caa tat aat gca caa get ggc aga ett gac ctg aga caa aag 720 

Tyr Gin Gin Tyr Asn Ala Gin Ala Gly Arg Leu Asp Leu Arg Gin Lys 
225 230 235 240 

get gaa tat eta aaa ggc ctt aat gat tgg get gag agg ctg tta caa 768 

Ala Glu tyr Leu Lys Gly Leu Asn Asp Trp Ala Glu Arg Leu Leu Gin 

245 250 255 

gag tta aat ggt acc aac aat gat ggt agg tct agg gac gtt acg gaa 816 

Glu Leu Asn Gly Thr Asn Asn Asp Gly Arg Ser Arg Asp Val Thr Glu 

260 265 270 

gag att gca gca aac aat ace aea gta caa aat ata egt tta cgt aac 864 

Glu lie Ala Ala Asn Asn Thr Thr Val Gin Asn lie Arg Leu Arg Asn 

275 280 285 

gaa aac aag aac tta aaa gcg aaa aac gag gac tta gaa gcg aga tta 912 

Glu Asn Lys Asn Leu Lys Ala Lys Asn Glu Asp Leu Glu Ala Arg Leu 
290 295 300 

gag aat gca atg aat gtt gca gga aga gat ttt aag aga get gaa ttc 960 

Glu Asn Ala Met Asn Val Ala Gly Arg Asp Phe Lys Arg Ala Glu Phe 
305 310 315 320 

gca cct ctt act cgt get aca gca gac aat aaa gac gaa tta ata aaa 1008 

Ala Pro Leu Thr Arg Ala Thr Ala Asp Asn Lys Asp Glu Leu lie Lys 

325 330 335 

aga get aac ggt tat gag ata cag aac cat cag tta aca gtt gag aat 1056 

Arg Ala Asn Gly Tyr Glu lie Gin Asn His Gin Leu Thr Val Glu Asn 

340 345 350 

aaa aaa tta aaa act gat aag gaa cag tta aca aaa gag aat gat gat 1104 

Lys Lys Leu Lys Thr Asp Lys Glu Gin Leu Thr Lys Glu Asn Asp Asp 

355 360 365 

tta aaa cac gtg aac ggt gat ggt aat cct agg gaa gtt ata gaa gat 1152 

Leu Lys His Val Asn Gly Asp Gly Asn Pro Arg Glu Val lie Glu Asp 
370 375 380 
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ctt gca gca aac aat ccc gca ata caa aat ata cgt tta cgt cac gaa 1200 

Leu Ala Ala Asn Asn Pro Ala lie Gin Asn lie Arg Leu Arg His Glu 
385 390 395 400 

aac aag gac tta aaa gcg aga tta gag aat gca atg gaa gtt gca gga 1248 

Asn. Lys Asp Leu Lys Ala Arg Leu Glu Asn Ala Met Glu Val Ala Gly 
405 410 415 

aga gat ttt aag aga get cac cac cac cac cac cac tgc taa 1290 

Arg Asp Phe Lys Arg Ala His His His His His His Cys 
420 425 



<210> 16 
<211> 429 
<212> PRT - 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hybrid of Group A Streptococci M 
protein and M-like protein 

<400> 16 

Met Asn Gly Asp Gly Asn Pro Arg Glu Val lie Glu Asp Leu Ala Ala 
1 5 10 15 

Asn Asn Pro Ala He Gin Asn lie Arg Leu Arg His Glu Asn Lys Asp 

20 25 30 

Leu Lys Ala Arg Leu Glu Asn Ala Met Glu Val Ala Gly Arg Asp Phe 
35 40 45 

Lys Arg Ala Leu Asp Asp His Ser Asp Leu Val Ala Glu Lys Gin Arg 
50 55 60 

Leu Glu Asp Leu Gly Gin Lys Phe Glu Arg Leu Lys Gin Arg Ser Glu 
65 70 75 80 

Leu Tyr Leu Gin Gin Tyr Tyr Asp Asn Lys Ser Asn Gly Tyr Lys Gly 
85 90 95 

Asp Trp Tyr Val Gin Gin Leu Gly Ser Asp Ser Val Ser Gly Leu Glu 
100 105 110 

Val Ala Asp Pro Ser Asp Ser Lys Lys Leu He Glu Leu Gly Leu Ala 
115 120 125 

Lys Tyr Leu Asn Asp Lys Leu Pro Phe Lys Thr Lys Glu Asp Ser Glu 
130 135 140 

lie Leu Ser Glu Leu Arg Asp Val Leu Lys Asn Leu Gin Glu Ser Pro 
145 150 155 160 

Lys Ser Thr Glu Thr Ser Ala Asn Gly Ala Asp Lys Leu Ala Asp Ala 
165 170 175 



Tyr Asn Thr Leu Leu Thr Glu His Glu Lys Leu Arg Asp Glu Tyr Tyr 
180 185 190 
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Thr Leu lie Asp Ala Lys Glu Glu Glu Pro Arg Tyr Lys Ala Leu Gly 
195 200 205 

Thr Leu Leu Asp Gin Val Thr Gin Leu Tyr Thr Lys His Asn Ser Asn 
210 215 220 

Tyr Gin Gin Tyr Asn Ala Gin Ala Gly Arg Leu Asp Leu Arg Gin Lys 
225 230 235 240 

Ala Glu Tyr Leu Lys Gly Leu Asn Asp Trp Ala Glu Arg Leu Leu Gin 
245 250 255 

Glu Leu Asn Gly Thr Asn Asn Asp Gly Arg Ser Arg Asp Val Thr Glu 
260 265 270 

Glu He Ala Ala Asn Asn Thr Thr Val Gin Asn He Arg Leu Arg Asn 
275 280 285. 

Glu Asn Lys Asn Leu Lys Ala Lys Asn Glu Asp Leu Glu Ala Arg Leu 
290 295 300 

Glu Asn Ala Met Asn Val Ala Gly Arg Asp Phe Lys Arg Ala Glu Phe 
305 310 315 320 

Ala Pro Leu Thr Arg Ala Thr Ala Asp Asn Lys Asp Glu Leu He Lys 
325 330 335 

Arg Ala Asn Gly Tyr Glu He Gin Asn His Gin Leu Thr Val Glu Asn 
340 345 350 

Lys Lys Leu Lys Thr Asp Lys Glu Gin Leu Thr Lys Glu Asn Asp Asp 
355 360 365 

Leu Lys His Val Asn Gly Asp Gly Asn Pro Arg Glu Val He Glu Asp 
370 375 380 

Leu Ala Ala Asn Asn Pro Ala He Gin Asn He Arg Leu Arg His Glu 
385. 390 395 400 

Asn Lys Asp Leu Lys Ala Arg Leu Glu Asn Ala Met Glu Val Ala Gly 
405 410 415 

Arg Asp Phe Lys Arg Ala His His His His His His Cys 
420 425 



<210> 17 
<211> 5347 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Vector 
<400> 17 

tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 60 
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cagcgtgacc gctacacttg ccagcgccct 
ctttctcgcc acgttcgccg gctttccccg 
gttccgattt agtgctttac ggcacctcga 
acgtagtggg ccatcgccct gatagacggt 
ctttaatagt ggactcttgt tccaaactgg 
ttttgattta taagggattt tgccgatttc 
acaaaaattt aacgcgaatt ttaacaaaat 
tcggggaaat gtgcgcggaa cccctatttg 
tccgctcatg aattaattct tagaaaaact 
tcatatcagg attatcaata ccatattttt 
actcaccgag gcagttccat aggatggcaa 
gtccaacatc aatacaacct attaatttcc 
aatcaccatg agtgacgact gaatccggtg 
agacttgttc aacaggccag ccattacgct 
cgttattcat tcgtgattgc gcctgagcga 
aattacaaac aggaatcgaa tgcaaccggc 
tttcacctga atcaggatat tcttctaata 
tggtgagtaa ccatgcatca tcaggagtac. 
taaattccgt cagccagttt agtctgacca 
ctttgccatg tttcagaaac aactctggcg 
tcgcacctga ttgcccgaca ttatcgcgag 
tgttggaatt taatcgcggc ctagagcaag 
cccttgtatt actgtttatg taagcagaca 
cgtgagtttt cgttccactg agcgtcagac 
gatccttttt ttctgcgcgt aatctgctgc 
gtggtttgtt tgccggatca agagctacca 
agagcgcaga taccaaatac tgtccttcta 
aactctgtag caccgcctac atacctcgct 
agtggcgata agtcgtgtct taccgggttg 
cagcggtcgg gctgaacggg gggttcgtgc 
accgaactga gatacctaca gcgtgagcta 
aaggcggaca ggtatccggt aagcggcagg 
ccagggggaa acgcctggta tctttatagt 
cgtcgatttt tgtgatgctc gtcagggggg 
gcctttttac ggttcctggc cttttgctgg 
tcccctgatt ctgtggataa ccgtattacc 
agccgaacga ccgagcgcag cgagtcagtg 
tattttctcc ttacgcatct gtgcggtatt 
caatctgctc tgatgccgca tagttaagcc 
ggtcatggct gcgccccgac acccgccaac 
gctcccggca tccgcttaca gacaagctgt 
gttttcaccg tcatcaccga aacgcgcgag 
gtgaagcgat tcacagatgt ctgcctgttc 
aagcgttaat gtctggcttc tgataaagcg 
ggtcactgat gcctccgtgt aagggggatt 
acgagagagg atgctcacga tacgggttac 
ttgtgagggt aaacaactgg cggtatggat 
tcaatgccag cgcttcgtta atacagatgt 
tgcgatgcag atccggaaca taatggtgca 
cgaaacacgg aaaccgaaga ccattcatgt 
gcagtcgctt cacgttcgct cgcgtatcgg 
ccgccagcct agccgggtcc tcaacgacag 
catgccggcg ataatggcct gcttctcgcc 
ggcttgagcg agggcgtgca agattccgaa 
gctccagcga aagcggtcct cgccgaaaat 
gagttgcatg ataaagaaga cagtcataag 
ccggaaggag ctgactgggt tgaaggctct 
atgagtgagc taacttacat taattgcgtt 



agcgcccgct cctttcgctt tcttcccttc 120 
tcaagctcta aatcgggggc tccctttagg 180 
ccccaaaaaa cttgattagg gtgatggttc 240 
ttttcgccct ttgacgttgg agtccacgtt 300 
aacaacactc aaccctatct cggtctattc 360 
ggcctattgg ttaaaaaatg agctgattta 420 
attaacgttt acaatttcag gtggcacttt 480 
tttatttttc taaatacatt caaatatgta 540 
catcgagcat caaatgaaac tgcaatttat 600 
gaaaaagccg tttctgtaat gaaggagaaa 660 
gatcctggta tcggtctgcg attccgactc 720 
cctcgtcaaa aataaggtta tcaagtgaga 780 
agaatggcaa aagtttatgc atttctttcc 840 
cgtcatcaaa atcactcgca tcaaccaaac 900 
gacgaaatac gcgatcgctg ttaaaaggac 960 
gcaggaacac tgccagcgca tcaacaatat 1020 
cctggaatgc tgttttcccg gggatcgcag 1080 
ggataaaatg cttgatggtc ggaagaggca 114 0 
tctcatctgt aacatcattg gcaacgctac 1200 
catcgggctt cccatacaat cgatagattg 1260 
cccatttata cccatataaa tcagcatcca 1320 
acgtttcccg ttgaatatgg ctcataacac 1380 
gttttattgt tcatgaccaa aatcccttaa 1440 
cccgtagaaa agatcaaagg atcttcttga 1500 
ttgcaaacaa aaaaaccacc gctaccagcg 1560 
actctttttc cgaaggtaac tggcttcagc 1620 
gtgtagccgt agttaggcca ccacttcaag 1680 
ctgctaatcc tgttaccagt ggctgctgcc 1740 
gactcaagac gatagttacc ggataaggcg 1800 
acacagccca gcttggagcg aacgacctac 18 60 
tgagaaagcg ccacgcttcc cgaagggaga 1920 
gtcggaacag gagagcgcac gagggagctt 1980 
cctgtcgggt ttcgccacct ctgacttgag 2040 
cggagcctat ggaaaaacgc cagcaacgcg 2100 
ccttttgctc acatgttctt tcctgcgtta 2160 
gcctttgagt gagctgatac cgctcgccgc 2220 
agcgaggaag cggaagagcg cctgatgcgg 2280 
tcacaccgca tatatggtgc actctcagta 2340 
agtatacact ccgctatcgc tacgtgactg 2400 
acccgctgac gcgccctgac gggcttgtct 2460 
gaccgtctcc gggagctgca tgtgtcagag 2520 
gcagctgcgg taaagctcat cagcgtggtc 2580 
atccgcgtcc agctcgttga gtttctccag 2640 
ggccatgtta agggcggttt tttcctgttt 2700 
tctgttcatg ggggtaatga taccgatgaa 2760 
tgatgatgaa catgcccggt tactggaacg 2820 
gcggcgggac cagagaaaaa tcactcaggg 2880 
aggtgttcca cagggtagcc agcagcatcc 2940 
gggcgctgac ttccgcgttt ccagacttta 3000 
tgttgctcag gtcgcagacg ttttgcagca 3060 
tgattcattc tgctaaccag taaggcaacc 3120 
gagcacgatc atgcgcaccc gtggggccgc 3180 
gaaacgtttg gtggcgggac cagtgacgaa 3240 
taccgcaagc gacaggccga tcatcgtcgc 3300 
gacccagagc gctgccggca cctgtcctac 3360 
tgcggcgacg atagtcatgc cccgcgccca 3420 
caagggcatc ggtcgagatc ccggtgccta 34 80 
gcgctcactg cccgctttcc agtcgggaaa 3540 
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cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat 3600 
tgggcgccag ggtggttttt cttttcacca gtgagacggg caacagctga ttgcccttca 3660 
ccgcctggcc ctgagagagt tgcagcaagc ggtccacgct ggtttgcccc agcaggcgaa 3720 
aatcctgttt gatggtggtt aacggcggga tataacatga gctgtcttcg gtatcgtcgt 3780 
atcccactac cgagatatcc gcaccaacgc gcagcccgga ctcggtaatg gcgcgcattg 3840 
cgcccagcgc catctgatcg ttggcaacca gcatcgcagt gggaacgatg ccctcattca 3900 
gcatttgcat ggtttgttga aaaccggaca tggcactcca gtcgccttcc cgttccgcta 3960 
tcggctgaat ttgattgcga gtgagatatt tatgccagcc agccagacgc agacgcgccg 4 020 
agacagaact taatgggccc gctaacagcg cgatttgctg gtgacccaat gcgaccagat 4080 
gctccacgcc cagtcgcgta ccgtcttcat gggagaaaat aatactgttg atgggtgtct 4140 
ggtcagagac atcaagaaat aacgccggaa cattagtgca ggcagcttcc acagcaatgg 4200 
catcctggtc atccagcgga tagttaatga tcagcccact gacgcgttgc gcgagaagat 4260 
tgtgcaccgc cgctttacag gcttcgacgc cgcttcgttc taccatcgac accaccacgc 4 320 
tggcacccag ttgatcggcg cgagatttaa tcgccgcgac aatttgcgac ggcgcgtgca 4380 
gggccagact ggaggtggca acgccaatca gcaacgactg tttgcccgcc agttgttgtg 4440 
ccacgcggtt gggaatgtaa ttcagctccg ccatcgccgc ttccactttt tcccgcgttt 4500 
tcgcagaaac gtggctggcc tggttcacca cgcgggaaac ggtctgataa gagacaccgg 4560 
catactctgc gacatcgtat aacgttactg gtttcacatt caccaccctg aattgactct 4620 
cttccgggcg ctatcatgcc ataccgcgaa aggttttgcg ccattcgatg gtgtccggga 4 680 
tctcgacgct ctcccttatg cgactcctgc attaggaagc agcccagtag taggttgagg 4 740 
ccgttgagca ccgccgccgc aaggaatggt gcatgcaagg agatggcgcc caacagtccc 4800 
ccggccacgg ggcctgccac catacccacg ccgaaacaag cgctcatgag cccgaagtgg 4860 
cgagcccgat cttccccatc ggtgatgtcg gcgatatagg cgccagcaac cgcacctgtg 4 920 
gcgccggtga tgccggccac gatgcgtccg gcgtagagga tcgagatcta atcataaaaa 4 980 
atttatttgc tttgtgagcg gataacaatt ataatagatt caattgtgag cggataacaa 5040 
ttataataga ttcaattcta aatttacaag aatttcacac agaattcatt aaagaggaga 5100 
aattacatat ggctagcatg actggtggac agcaaatggg tcgcggatcc gaattcgagc 5160 
tccgtcgaca agcttgcggc cgcactcgag caccaccacc accaccactg agatccggct 5220 
gctaacaaag cccgaaagga agctgagttg gctgctgcca ccgctgagca ataactagca 5280 
taaccccttg gggcctctaa acgggtcttg aggggttttt tgctgaaagg aggaactata 5340 
tccggat 5347 



<210> 18 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Carboxy terminal tag sequence 
<400> 18 

Asp Tyr Lys Asp Asp Asp Asp Lys 
1 5 

<210> 19 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Carboxy terminal tag sequence 
<400> 19 

Asp Leu Tyr Asp Asp Asp Asp Lys 
1 5 
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